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(57) Abstract: The present invention discloses the identification of a 
novel histamine receptor, termed H4. Amino acid sequences, nucleic acid 

sequences, vectors, and host cells are also discussed. Additionally, meth- 
ods of detecting agonists and antagonists for the receptor are disclosed 
herein. 
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HUMAN HISTAMINE H4 RECEPTOR 
5 This application claims priority under 35 U.S.C. § 1 19 from provisional patent 

application Serial Nos. 60/202,151, filed May 5, 2000, 60/227,567, filed August 23, 2000, and 
60/247,855, filed November 13, 2000; which all are hereby incorporated by reference in their 
entireties. 

10 FIELD OF THE INVENTION 

The present invention discloses the identification of a novel histamine receptor, 

termed H4. 

BACKGROUND OF THE INVENTION 
15 Histamine produces numerous physiological effects in the body through 

interaction with one of three different cell surface receptors, classified as H,, Hj, and H3. 

These receptors belong to the guanine nucleotide binding protein coupled receptor (G-protein 

coupled receptors, GPCR) class. 

Stimulation of histamine receptors produces symptoms that are typically 
20 associated with physiological responses to allergic stimuli (Ash and Schild, Br. J. Pharmacol. 

1966, 27:427). These effects are blocked by Hj antagonists such as, for example, 

diphenhydramine. Hj antagonists are generally defined as "classical antihistamines". Classical 

antihistamines are the active ingredient in most over-the-coimter allergy medications. 

Pharmacological studies indicate that agonist activation of these receptors stimulates the inositol 
25 phosphate pathway, and thus stimulates formation of inositol triphosphate (IP3) and 

diacylglycerol (DAG), 

Histamine Hj receptors have been shown to play a role in gastric acid secretions 

(Black et aL, Nature 1972, 236:385). Histamine Hj receptor amtagonists such as, for example, 

cimetidine and ranitidine, are often the active ingredient in over-the-coxmter and prescription 
30 drugs that are used to treat duodenal ulcers, gastric ulcers, heartburn, indigestion, and other 

disorders of the gastrointestinal tract. Activation of histamine Hj stimulates adenylyl cyclase 

activity and stimulates formation of cAMP. 
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Histamine H3 receptors are a relatively new member of the histamine receptor 
class. These receptors were originally described as histamine autoreceptors. These receptors 
were shown to be located on presynaptic histamine nerve terminals and to control the rate of 
histamme synthesis and release in the brain (Arrang etal , Nature 1 983, 302:832). Recent studies 
indicate that H3 receptors also are present on non-histamine presynaptic nerve terminals. H3 
receptors may modulate the synthesis and release of other neurotransmitters such as dopamine, 
serotonin, acetylcholine, and norepinephrine. H3 receptors also have been found in peripheral 
tissues. Histamine activation of H3 receptor inhibits adenylyl cyclase activity and formation of 
cAMP. 

Multiple pharmacological studies have indicated the presence of H3 receptor 
subtypes (Leurs et al. JPET 1996, 276:1009-1015; Gumming and Gjede, Brain Res. 1994, 
641:203-207; Calpham and Kilpatrick, Br. J. Pharmacol 1996, 107:919-923; Schworer et al 
Naunyn-Schmiedeberg's Arch. Pharmacol. 1994, 350:375-379; Schkicker et al, Naunyn- 
Schmiedeberg's Arch. Pharmacol. 1996, 353:482-488). Additionally, pharmacological 
characterization of a histamine receptor on eosinophils describes the greater potency of histamine 
compared to R-a-methylhistamine(Raible, etal Am. J. Respir. Grit. Care.Med 1994 , 149: 1506- 
1511.). However, the existence of these receptor subtypes has yet to be substantiated by 
molecular biological techniques. 

SUMMARY OF THE TNVENTTON 
The present invention contemplates an isolated histamine H4 receptor protein 
having an amino acid sequence at least 51% identical or comprising at least 10 contiguous amino 
acids from the sequence depicted m SEQ ID NO:2. The H4 receptor protein binds ligands 
comprising an imidazole and an anune, which imidazole and amine are attached by an alkyl 
Cham, where the rank order of efiScacy of modulation of second messenger formation of the 
Ugands at the H4 receptor protein is 5>6=10>8=4, where the number represents the number of 
carbons in the alkyl chain. In one embodiment, upon binding histamine or a histamine agonist 
the receptor protein inhibits second messenger formation. Preferably the second messenger is 
cAMP. 

The present application also discloses an isolated receptor protein having an 
amino acid sequence with at least 95% sequence identity to human H^ receptor protein having 



15 



25 



30 
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an amino acid sequence as depicted in SEQ ID NO: 2. In a specific embodiment, the protein is 
a human H4 receptor protein, e.g., having an amino acid sequence as depicted in SEQ ID NO:2 
or an allelic variant of that sequence. 

Also provided are nucleic acids, e.g., cDNAs encoding the H4 receptor proteins 
5 as discussed above. In another embodiment, an isolated nucleic acid encoding an H4 receptor 
protein, which nucleic acid hybridizes under stringent conditions to a nucleic acid having a 
sequence of at least 20 nucleotides identical to a corresponding nucleotide sequence of the same 
number of bases in SEQ ID NO: l or its complement. 

The isolated nucleic acids encoding H4 receptors can be part of vectors, e.g., for 
10 cloning, expression, and/or expansion. An expression vector comprises the nucleic acid 
encoding the H4 receptor protein operably associated with an expression control sequence. The 
invention further provides host cells and non-human transgenic animals containing such an 
expressible vector, and methods for producing an H4 receptor polypeptide using such host cells. 

In addition, the invention provides an isolated nucleic acid, such as a primer or 
1 5 probe, of at least 1 0 bases having a nucleotide sequence identical to a corresponding nucleotide 
sequence of the same number of bases in SEQ ID NO: 1 , or its complement. The invention also 
provides an antibody that specifically binds an H4 receptor. 

The present invention further contemplates a method for detecting expression of 
H4 receptor, which method comprises detecting mRNA encoding H4 receptor in a sample from 
20 a cell suspected of expressing H4 receptor or detecting the H4 receptor protein with an antibody 
of the invention. 

The present invention also contemplates an assay system for identifying H4 
receptor ligands. The assay system comprises a sxifficient number of transformed host cells to 
be able to detect an alteration in second messenger accumulation. Preferably, the second 

25 messenger is cAMP. 

The present invention also contemplates a method for identifying a test compound 
that antagonizes or agonizes histamine H4 receptors. The method comprises detecting an 
alteration in the level of a second messenger in the assay system contacted with the test 
compound. In the method, an increase in the level of the second messenger indicates that the test 

30 compound antagonizes the H4 receptor, A decrease in the level of the second messenger 
indicates that the test compound agonizes the H4 receptor. 
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The present invention also discloses an isolated nucleic acid that specifically 
hybridizes under highly stringent conditions to the complement of the sequence depicted in SEQ 
ID NO: 1 , where the nucleic acid encodes a histamine H4 receptor protein. 

BRIEF DESCRIPTION OF DRAWINGS 

Figure 1. Sequence of the human H4 cDNA (SEQ ID NO: 1) and the deduced 
amino acid sequence of the protein it encodes (SEQ ID NO:2). The TM domains are denoted by 
lines. 

Figures 2A, 2B, and 2C. (A) Agonist assay. Square: histamine; triangle: R-a- 
methylhistamine; inverted triangle: clobenpropit; diamond: thioperamide. (B) Antagonist assay. 
Asfsay were conducted in duplicate and presented as the average ± SEM. (C) Activity in 
mammalian cells. 

Figure 3. Antagonist activity of a series of histamine analogs. Square: 
clobenpropit; triangle: thioperamide. Assay were conducted in duplicate and presented as the 
average ± SEM.' 

Figure 4. H4 expression in from 6 human T cell clones derived from a single 
human donor. Three of these clones were CD4+ cell clones (RG4.3B, RG4.3A, and RG4.3), 
two were CD8+ cell clones (RG8.1C and RG8.1A), and one was anNKT cell clone (RGl). 

Figure 5. H4 expression in bulk populations of CD4+ and CD8+ T cells isolated 
from three different individuals. B-CD8 and B-CD4 were CD8H- and CD4+ cells, respectively, 
obtained from individual 1. L-CD8 and L-CD4 were CD8+ and CD4+ cells, respectively, 
obtained from individual 2. U-CD8 and U-CD4 were CD8+ and CD4+ cells, respectively, 
obtained from individual 3, 

DETAILED DESCRIPTION 
The present invention is based, in part, on discovery of a novel histamine receptor, 
which has been termed H4. The new histamine receptor was cloned from a human heart library 
using H3 specific primers, but proved to be a new histamine receptor. Yeast and human 
endothelial kidney (HEK) cells were transformed with a human H4 expression vector. 
Transformed cells in multi-well plates were treated with test compoxmds, and regulation of the 
intracellular second messenger cyclic adenosine monophosphate (cAMP) formation was 
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determined. Modulation of cAMP formation is ligand and concentration dependent. 

The nucleic acid and protein sequences of H4 shows homology to known G- 
protein coupled receptors. Specifically, the H4 protein shows homology to biogenic-amine G- 
protein coupled receptors. As in other biogenic amine receptors, the present protein contains 
5 conserved aspartate residues within transmembrane (TM) domains 2 and 3 (at positions 61 and 
94). A DY motif (D94 and Y95) in TM3 is found only in histamine and muscarinic receptors. 
A DRY sequence motif that exists at the cjrtoplasmic interface of TM3 of all GPCRs also can be 
found in the present sequence (amino acids 111-113). Also present are two conserved cysteine 
residues in the first and second extracellular loops (position 88 and 164), which are predicted 

10 to form a disulfide bond; two conserved tryptophan residues in TM4 and TM6 (positions 1 40 and 
316); and conserved proline residues in TM5, TM6, and TM7 (positions 186, 318, and 355, 
respectively). The putative TM domains are shown in Figure 1. H4 shares sequence identity 
with human H3 receptors. Sequence comparison indicates an overall sequence identity of about 
44% and overall sequence similarity of about 51%. 

15 The present invention also contemplates an assay method and system for 

identifying selective H4 receptor ligands. The method involves detecting binding of a test 
compound to isolated cell membranes containing the histamine H4 receptor. The assay system 
comprises transformed host cells that express H4 receptors, where the number of cells in the assay 
system is sufficient to detect an alteration in second messenger accumulation. The test system 

20 also includes an appropriate cell culture medium to permit cell growth and viability, and 
preferably tissue culture plates or arrays containing the host cells in cell culture medium, hi a 
specific embodiment, the second messenger that is detected is cAMP. In a further embodiment, 
the receptor is a human receptor. 

The invention also discloses a method for identifying a test compoxmd that 

25 antagonizes or agonizes histamine H4 receptors. The method comprises detecting an increase 
(antagonist) or decrease (agonist) in the level of a second messenger in the assay system when 
contacted with the test compound. 

Thus, the present invention advantageously provides H4 protein, including 
fragments, derivatives, and analogs of H4; H4 nucleic acids, including oligonucleotide primers 

30 and probes, and H4 regulatory sequences (especially an H4 primer and splice sites with introns); 
H4-specific antibodies; and related methods of using these materials to detect the presence of H4 
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proteins or nucleic acids, binding partners, and in screens for agonists and antagonists of H^. 

The following sections of the application, which are delineated by headings (in 
bold) and sub-headings (in bold italics), which cover these three aspects of the invention, are 
5 provided for clarity, and not by way of limitation. 

General Definitions 
As used herein, the term "isolated" means that the referenced material is removed 
from the environment in which it is normally found. Thus, an isolated biological material can 
be free of cellular components, i.e., components of the cells in which the material is foimd or 

10 produced in nature. In the case of nucleic acid molecules, an isolated nucleic acid includes a 
PGR product, an isolated mRNA, a cDNA, or a restriction fragment. In another embodiment, 
an isolated nucleic acid is preferably excised from the chromosome in which it may be found, 
and more preferably is no longer joined to non-regulatoiy, non-coding regions, or to other genes, 
located upstream or downstream of the gene contained by the isolated nucleic acid molecule 

1 5 when found in the chromosome. In yet another embodiment, the isolated nucleic acid lacks one 
or more introns. Isolated nucleic acid molecules include sequences inserted into plasmids, 
cosmids, artificial chromosomes, and the like. Thus, in a specific embodiment, a recombinant 
nucleic acid is an isolated nucleic acid. An isolated protein may be associated with other proteins 
or nucleic acids, or both, with which it associates in the cell, or with cellular membranes if it is 

20 a membrane-associated protein, A protein expressed from a vector in a cell, particularly a cell, 
in which the protein is normally not expressed is also a regarded as isolated. An isolated 
organelle, cell, or tissue is removed from the anatomical site in which it is found in a cell or an 
organism. An isolated material may be, but need not be, purified. 

The term "purified" as used herein refers to material that has been isolated under 

25 conditions that reduce or eliminate the presence of unrelated materials, i.e., contaminants, 
including native materials from which the material is obtained. For example, a purified protein 
is preferably substantially free of other proteins or nucleic acids with which it is associated in a 
cell; a purified nucleic acid molecule is preferably substantially free of proteins or other unrelated 
nucleic acid molecules with which it can be found within a cell. As used herein, the term 

30 "substantially free" is used operationally, in the context of analytical testing of the material. 
Preferably, purified material substantially free of contanunants is at least 50% pure; more 
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preferably, at least 90% pure; and more preferably still at least 99% pure. Purity can be evaluated 
by chromatography, gel electrophoresis, immunoassay, composition analysis, biological assay, 
and other methods known in the art. 

Methods for purification are well-known in the art. For example, nucleic acids 
can be purified by precipitation, chromatography (including preparative solid phase 
chromatography, oligonucleotide hybridization, and triple helix chromatography), 
ultracentrifiigation, and other means. Polypeptides and proteins can be purified by various 
methods including, without limitation, preparative disc-gel electrophoresis, isoelectric focusing, 
HPLC, reversed-phase HPLC, gel filtration, ion exchange and partition chromatography, 
precipitation and salting-out chromatography, extraction, and countercurrent distribution. For 
some purposes, it is preferable to produce the polypeptide in a recombinant system in which the 
protein contains an additional sequence tag that facilitates purification, such as, but not limited 
to, a polyhistidine sequence, or a sequence that specifically binds to an antibody, such as FLAG 
and GST. The polypeptide can then be purified fi*om a crude lysate of the host cell by 
chromatography on an appropriate solid-phase matrix. Alternatively, antibodies produced against 
the protein or against peptides derived therefi"om can be used as purification reagents. Cells can 
be purified by various techniques, including centrifiigation, matrix separation (e.g., nylon wool 
separation), paiming and other immunoselection techniques, depletion {e.g., complement 
depletion of contaminating cells), and cell sorting (e,g,, fluorescence activated cell sorting 
[FACS]). Other purification methods are possible. A purified material may contain less than 
about 50%, preferably less than about 75%, and most preferably less than about 90%, of the 
cellular components with which it was originally associated. The "substantially pure" indicates 
the highest degree of purity which can be achieved using conventional purification techniques 
known in the art. 

In a specific embodiment, the term "about" or "approximately" means within a 
scientifically acceptable error range for a given value relative to the precision with which the 
value is or can be measured, e.g., within 20%, preferably within 1 0%, and more preferably within 
5% of a given value or range. Alternatively, particularly with biological systems, the term can 
mean within an order of magnitude, preferably within 5-fold and more preferably within 2-fold 
of a given value. 

A "sample" as used herein refers to a biological material which can be tested for 
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the presence of H4 protein or H4 nucleic acids. Such samples can be obtained from animal 
subjects, such as humans and non-human animals, and include tissue, especially muscle, 
biopsies, blood and blood products; plural effusions; cerebrospinal fluid (CSF); ascites fluid; and 
cell culture. 

Non-human animals include, without limitation, laboratory animals such as mice, 
rats, rabbits, hamsters, guinea pigs, etc.; domestic animals such as dogs and cats; and, farm 
animals such as sheep, goats, pigs, horses, and cows. 

The use of italics indicates a nucleic acid molecule; normal text indicates the 
polypeptide or protein. 

The term "Ugand" refers to a compound that recognizes and binds to a receptor 
binding site. In a specific embodiment, the ligand binds to the histamine H4 receptors of the 
invention. Upon binding to the receptor, the ligand may produce agonist or antagonist functional 
effects. 

The term "agonist" refers to a ligand that binds to the receptor and produces a 
functional effect similar to that produced by the endogenous ligand for the receptor. In sl specific 
embodiment, the agonist at the histamine H4 receptor produces an effect similar to that produced 
by histamine, the endogenous ligand (histamine) for the H4 receptor. Examples of such agonists 
include, but are not limited to, R-a-methyl histamine and imetit. 

The term "antagonist" refers to a ligand that binds to the receptor and blocks a 
functional effect produced by an agonist for the receptor or the endogenous ligand of the receptor. 
Examples of such antagonists include, but are not limited to, thioperamide. 

The term "selective" refers to the ability of a histamine H4 agonist or antagonist 
to elicit a response from the H4 receptor while eliciting minimal responses from another receptor. 
Stated differently, a selective H4 agonist may be a potent agonist for the H4 receptor while 
agonizing another receptor, such as another G-protein coupled receptor and particularly another 
histamine receptor, poorly or not at all. 

The term "ability to elicit a response" refers to the ability of a H4 agonist or 
antagonist ligand to agonize or antagonize H4 receptor activity. 

As used herein the term "transformed cell" refers to a modified host cell that 
expresses a functional H4 receptor expressed from a vector encoding the histamine receptor. 
Any cell can be used, preferably a mammalian cell, and more preferably a HEK cell. 
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A "functional histamine receptor" is a receptor that binds histamine or H4 agonists 
and transduces a signal upon such binding. Preferably the H4 receptor is a human H4. Preferably, 
the signal that is transduced is accumulation of a second messenger, preferably cAMP. 
Histamine H4 receptors may be derived from a variety of sources, including manunal, e.g., 
human, bovine, porcine, and canine; and avian. 

The cells of the invention are particularly suitable for an assay system for 
histamine H4 receptor ligands that modulate second messenger accumulation. An "assay system" 
is one or more collections of such cells, e.g. , in a micro well plate or some other culture system. 
To permit evaluation of the effects of a test compound on the cells, the number of cells in a single 
assay system is sufficient to express a detectable amoimt of the regulated second messenger 
accumulation at least under conditions of maximimi second messenger accumulation. 

A "second messenger" is an intracellular molecule or ion, where formation and/or 
accumulation of the second messenger is regulated by activation of cellular membranes. In one 
embodiment, cellular membranes contain G-protein coupled receptor, ion channels, and tyroinse 
kinase receptors. In the context of this invention, the cellular membrane is a G-protein coupled 
receptor, preferably a histamine H4 receptor. In a specific embodiment, the second messenger 
is one or more of cAMP, cGMP, inositol phosphate, DAG, and ions such as calcium and 
potassium. Preferably, the second messenger is cAMP. 

A "test compovmd" or "candidate compound" is any molecule that can be tested 
for its ability to bind H4 receptors, and preferably modulate second messenger accumulation 
through the H4 receptor, as set forth herein. A compound that binds, and preferably modulates 
H4 is a "lead compound" suitable for further testing and development as an H4 agonist or 
antagonist- 

As used herein, the term "provide" refers to supplying the compoxmds or 
pharmaceutical compositions of the present invention to cells or to an animal, preferably a 
human, in any form. For example, a prodrug form of the compounds may be provided the subject, 
which then is metabolized to the compound in the body. 
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Receptor 

H4 receptor, as defined herein, refers to a polypeptide having about 390 amino 

acids. 

The protein is transcribed firom a nucleic acid sequence that is about 1 173 base pairs in length. 
5 The H4 protein has significant homology to the H3 receptor. Thus, H4 refers to orthologs and 
allelic variants, e.g., a protein having greater than about 50%, preferably greater than 80%, more 
preferably still greater than 90%, and even more preferably greater than 95% overall sequence 
identity to SEQ ID NO: 2. Allelic variants may differ fi-om 1 to about 5 amino residues firom 
SEQ ED NO:2. In a specific embodiment, H4 has an amino acid sequence as shown in SEQ ID 
10 NO:2. 

Sequence comparison studies between human H4 protein and human H3 protein, 
indicates sequence identity of about 44%. The predicted protein sequence contains residues 
which are characteristic of the class of biogenic amine receptors. As in other biogenic amine 
receptors, the present protein contains conserved aspartate residues within TM2 and TM3 (at 

15 positions 61 and 94). A DY motif, found only in histamine and muscarinic receptors, is found 
at positions 94-95. A DRY sequence motif that exists at the cytoplasmic interface of TM3 of all 
GPCRs also can be found in amino acids 111-113. Also present are two conserved cysteine 
residues in the first and second extracellular loops (position 88 and 164), two conserved 
tryptophan residues in TM4 and TM6 (positions 140 and 316); and conserved proline residues 

20 in TM5, TM6, and TM7 (positions 186, 3 18, and 355, respectively). 

H4 receptors, like and H3 receptors, modulates adenylyl cyclase activity. 
Therefore, the receptor modulates accumulation of the intracellular messenger cAMP. 
Modulation of H4 receptors maybe a treatment for transplanted organ rejection, asthma, allergies, 
and autoimmune pathologies, such as, but not limited to, multiple sclerosis, type I diabetes, and 

25 rheumatoid arthritis. Additionally, CNS fimctions such as cognitive and memory defects may 
be treated. 

H4 fi-agments, derivatives, and analogs can be characterized by one or more of 
the characteristics of H4 protein. In a specific embodiment, in order to develop the specific 
C-terminal and N-terminal H4 antibodies, antibodies can be raised against extracellular or 
30 cytoplasmic portions of the H4 protein, preferably or antigenic peptides identified using a 
hydrophobicity profile or other algorithms. 
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Analogs and derivatives of the H4 receptor of the invention have the same or 
homologous characteristics of H4 as set forth above. For example, a truncated form of H4 can be 
provided. Such a truncated form includes H4 with a either an N-terminal, C-terminal, or internal 
deletion. In a specific embodiment, the derivative is functionally active, /.e, capable of 
5 exhibiting one or more functional activities associated with a full-length, wild-type H4 of the 
invention. Such functions include, but are not limited to, inhibition of adenylyl cyclase activity 
and cAMP formation. Altematively, a H4 chimeric fusion protein can be prepared in which the 
H4 portion of the fusion protein has one or more characteristics of H4. Such fusion proteins 
include fusions of the H4 receptor with a marker polypeptide, such as FLAG, a histidine tag, a 

1 0 myc tag, or glutathione-S-transferase (GST). Alternatively, the H4 receptor can be fused with an 
expression-related peptide, such as yeast a-mating factor, a heterogeneous signal peptide, or a 
peptide that renders the protein more stable upon expression. H4 can also be fused with a unique 
phosphorylation site for labeling. 

H4 analogs can be made by altering encoding nucleic acid sequences by 

15 substitutions, additions or deletions that provide for functionally similar molecules, z.e., 
molecules that perform one or more H4 functions. In a specific embodiment, an analog of H4 is 
a sequence-conservative variant of H4. In another embodiment, an analog of H4 is a function- 
conservative variant. In yet another embodiment, an analog of H4 is an allelic variant or a 
homologous variant from another species. In a specific embodiment, human variants of H4 are 

20 described. 

H4 derivatives include, but are by no means limited to, phosphorylated H4, 
glycosylated H4, methylated H4, and other H4 proteins that are otherwise chemically modified. 
H4 derivatives also include labeled variants, e.g., radio-labeled with iodine (or, as pointed out 
above, phosphorous); a detectable molecule, such as but by no means limited to biotin, a 
25 chelating group complexed with a metal ion, a chromophore"br fluorophore, a gold colloid, or 
a particle such as a latex bead; or attached to a water soluble polymer, 

Clommg and ExpressioB of 
The present invention contemplates analysis and isolation of a gene encoding a 
30 functional or mutant H4, including a full length, or naturally occurring form of H4, and any 
antigenic fragments thereof from any source, preferably human. It further contemplates 
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. expression of functional or mutant H4 protein for evaluation, diagnosis, or therapy. 

In accordance with the present invention there may be employed conventional 
molecular biology, microbiology, and recombinant DNA techniques within the skill of the art. 
Such techniques are explained fully in the literature. See, e.g., Sambrook, Fritsch & Maniatis, 
Molecular Cloning: A Laboratory Manual, Second Edition (1989) Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York (herein "Sambrook et al, 1989"); DNA 
Cloning: A Practical Approach, Volumes I and 11 (D.N. Glover ed. 1985); Oligonucleotide 
Synthesis (MJ. Gait ed. 1984); Nucleic Acid Hybridization [B.D. Hames & S.J. Higgins eds. 
(1985)]; Transcription And Translation [B.D. Hames & S.J. Higgins, eds. (1984)]; Animal Cell 
Culture [R.I. Freshney, ed. (1986)]; Immobilized Cells And Enzymes [IRL Press, (1986)]; 
B.Perbal, A Practical Guide To Molecular Cloning (1984); P.M. Ausubel et al (eds.), Current 
Protocols in Molecular Biology, John Wiley & Sons, Inc. (1994). 

Molecular Biology - Definitions 

"Amplification" of DNA as used herein denotes the use of polymerase chain 
reaction (PCR) to increase the concentration of a particular DNA sequence within a mixture of 
DNA sequences. For a description of PCR see Saiki et al. Science, 239:487, 1988. 

As used herein, "sequence-specific oligonucleotides" refers to related sets of 
oligonucleotides that can be used to detect allelic variations or mutations in the H4 gene. 

The nucleic acid molecules (polynucleotides) herein may be flanked by natural 
regulatory (expression control) sequences, or may be associated with heterologous sequences, 
including promoters, intemal ribosome entry sites (IRES) and other ribosome binding site 
sequences, enhancers, response elements, suppressors, signal sequences, polyadenylation 
sequences, introns, 5 - and 3 - non-coding regions, and the like. The nucleic acids may also be 
modified by many means known in the art. Non-limiting examples of such modifications include 
methylation, "caps", substitution of one or more of the naturally occurring nucleotides with an 
analog, and intemucleotide modifications such as, for example, those with uncharged linkages 
{e.g,, methyl phosphonates, phosphotriesters, phosphoroamidates, carbamates, etc.) and with 
charged linkages {e.g., phosphorothioates, phosphorodithioates, etc.). Polynucleotides may 
contain one or more additional covalently linked moieties, such as, for example, proteins (e.g., 
nucleases, toxins, antibodies, signal peptides, poly-L-lysine, etc.), intercalators {e.g., acridine, 
psoralen, etc.), chelators {e.g., metals, radioactive metals, iron, oxidative metals, etc.), and 
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alkylators. The polynucleotides may be derivatized by formation of a methyl or ethyl 
phosphotriester or an alkyl phosphoramidate linkage. Furthermore, the polynucleotides herein 
may also be modified with a label capable of providing a detectable signal, either directly or 
indirectly. Exemplary labels include radioisotopes, fluorescent molecules, biotin, and the like. 

A "coding sequence" or a sequence "encoding" an expression product, such as a 
RNA, polypeptide, protein, or enzyme, is a nucleotide sequence that, when expressed, results in ' 
the production of that RNA, polypeptide, protein, or enzyme, le., the nucleotide sequence 
encodes an amino acid sequence for that polypeptide, protein or enzyme. A coding sequence for 
a protein may include a start codon (usually ATG) and a stop codon. 

The term "gene", also called a "stmctural gene" means a DNA sequence that codes 
for or corresponds to a particular sequence of amino acids which comprise all or part of one or 
more proteins or enzymes, and may or may not include introns and regulatory DNA sequences, 
such as promoter sequences, 5 -untranslated region, or 3 -untranslated region which affect for 
example the conditions under which the gene is expressed. 

A "promoter sequence" is a DNA regulatory region capable of binding RNA 
polymerase in a cell and initiating transcription of a downstream (3* direction) coding sequence. 
For purposes of defining the present invention, the promoter sequence is bounded at its 3* 
terminus by the transcription initiation site and extends upstream (5' direction) to include the 
minimum mmiber of bases or elements necessary to initiate transcription at levels detectable 
above background. Within the promoter sequence will be found a transcription initiation site 
(conveniently defined for example, by mapping with nuclease SI), as well as protein binding 
domains (consensus sequences) responsible for the binding of RNA polymerase. The present 
invention includes the H4 receptor gene promoter foxmd in the genome, which can be operatively 
associated with a H4 coding sequence with a heterologous coding sequence. 

The term "host cell" means any cell of any organism that is selected, modified, 
transformed, grown, or used or manipulated in any way, for the production of a substance by the 
cell, for example the expression by the cell of a gene, a DNA or RNA sequence, a protein or an 
enzyme. Host cells can further be used for screening or other assays, as described infra. 

A coding sequence is "under the control of or "operatively associated with" 
transcriptional and translational control sequences in a cell when RNA polymerase transcribes 
the coding sequence into mRNA, which is then trans-RNA spliced (if it contains introns) and 
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translated, in the case of mRNA, into the protein encoded by the coding sequence. 

The tenns "express" and "expression" mean allowing or causing the information 
in a gene or DNA sequence to become manifest, for example producing a protein by activating 
the cellular functions involved in transcription and translation of a corresponding gene or DNA 
5 sequence. A DNA sequence is expressed in or by a cell to form an "expression product" such 
as a protein. The expression product itself, e,g., the resulting protein, may also be said to be 
"expressed" by the cell. An expression product can be characterized as intracellular, 
extracellular, transmembrane, or secreted. The term "intracellular" means something that is 
inside a cell. The term "extracellular" means something that is outside a cell. The H4 receptor 
10 is a seven transmembrane protein with intracellular and extracellular domains. A substance is 
"secreted" by a cell if it appears in significant measure outside the cell, from somewhere on or 
inside the cell. 

The term "transfection" means the introduction of a foreign nucleic acid into a 
cell. The term "transformation" means the introduction of a "foreign" (Le., extrinsic or 

15 extracellular) gene, DNA or RNA sequence to a host cell, so that the host cell will express the 
introduced gene or sequence to produce a desired substance, typically a protein or enzyme coded 
by the introduced gene or sequence. The introduced gene or sequence may also be called a 
"cloned" or "foreign" gene or sequence, may include regulatory or control sequences, such as 
start, stop, promoter, signal, secretion, or other sequences used by a cell's genetic machinery, 

20 The gene or sequence may include nonfunctional sequences or sequences with no known 
function. A host cell that receives and expresses introduced DNA or RNA has been 
"transformed" and is a "transformant" or a "clone." The DNA or RNA introduced to a host cell 
can come from any source, including cells of the same genxis or species as the host cell, or cells 
of a different genus or species. 

25 The terms "vector*', "cloning vector" and "expression vector" mean the vehicle 

by which a DNA or RNA sequence (e.g., a foreign gene) can be introduced into a host cell, so 
as to transform the host and promote expression (e.g., transcription and translation) of the 
introduced sequence. Vectors include plasmids, phages, viruses, etc.; they are discussed in 
greater detail below. 

30 Vectors typically comprise the DNA of a transmissible agent, into which foreign 

DNA is inserted. A common way to insert one segment of DNA into another segment of DNA 
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involves the use of enzymes called restriction enzymes that cleave DNA at specific sites 
(specific groups of nucleotides) called restriction sites. A "cassette" refers to a DNA coding 
sequence or segment of DNA that codes for an expression product that can be inserted into a 
vector at defined restriction sites. The cassette restriction sites are designed to ensure insertion 
5 of the cassette in the proper reading frame. Generally, foreign DNA is inserted at one or more 
restriction sites of the vector DNA, and then is carried by the vector into a host cell along with 
the transmissible vector DNA. A segment or sequence of DNA having inserted or added DNA, 
such as an expression vector, can also be called a "DNA construct." A common type of vector 
is a "plasmid", which generally is a self-contained molecule of double-stranded DNA, usually 

10 of bacterial origin, that can readily accept additional (foreign) DNA and which can readily 
introduced into a suitable host cell. A plasmid vector often contains coding DNA and promoter 
DNA and has one or more restriction sites suitable for inserting foreign DNA. Coding DNA is 
a DNA sequence that encodes a particular amino acid sequence for a particular protein or 
enzyme. Promoter DNA is a DNA sequence which initiates, regulates, or otherwise mediates or 

1 5 controls the expression of the coding DNA. Promoter DNA and coding DNA may be from the 
same gene or from different genes, and may be from the same or different organisms. A large 
number of vectors, including plasmid and fungal vectors, have been described for replication 
and/or expression in a variety of eukaryotic and prokaryotic hosts. Non-limiting examples 
include pKK plasmids (Clonetech), pUC plasmids, pET plasmids (N ovagen. Inc., Madison, WI), 

20 pRSET or pREP plasmids (Invitrogen, San Diego, CA), or pMAL plasmids (New England 
Biolabs, Beverly, MA), and many appropriate host cells, using methods disclosed or cited herein 
or otherwise known to those skilled in the relevant art. Recombinant cloning vectors will often 
include one or more replication systems for cloning or expression, one or more markers for 
selection in the host, e.g., antibiotic resistance, and one or more expression cassettes. 

25 The term "expression system" means a host cell and compatible vector under 

suitable conditions, e,g,, for the expression of a protein coded for by foreign DNA carried by the 
vector and introduced to the host cell. Common expression systems include E. coli host cells and 
plasmid vectors, insect host cells and Baculovirus vectors, and mammalian host cells and vectors. 

30 The term "heterologous" refers to a combination of elements not naturally 

occurring. For example, heterologous DNA refers to DNA not naturally located in the cell, or 
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in a chromosomal site of the cell. Preferably, the heterologous DNA includes a gene foreign to 
the cell. A heterologous expression regulatory element is such an element operatively associated 
with a different gene than the one it is operatively associated with in nature. In the context of the 
present invention, an H4 gene is heterologous to the vector DNA in which it is inserted for 
cloning or expression, and it is heterologous to a host cell containing such a vector, in which it 
is expressed, e.g„ a HEK cell. 

The terms "mutant" and "mutation" mean any detectable change in genetic 
material, e,g., DNA, or any process, mechanism, or result of such a change. This includes gene 
mutations, in which the structure (e.g., DNA sequence) of a gene is altered, any gene or DNA 
arising from any mutation process, and any expression product (e.g., protein or enzyme) 
expressed by a modified gene or DNA sequence. The term "variant" may also be used to indicate 
a modified or altered gene, DNA sequence, enzyme, cell, etc., any kind of mutant. 

"Sequence-conservative variants" of a polynucleotide sequence are those in which 
a change of one or more nucleotides in a given codon position results in no alteration in the 
amino acid encoded at that position. 

"Function-conservative variants" are those in which a given amino acid residue 
in a protein or enzyme has been changed without altering the overall conformation and function 
of the polypeptide, including, but not limited to, replacement of an amino acid with one having 
similar properties (such as, for example, polarity, hydrogen bonding potential, acidic, basic, 
hydrophobic, aromatic, and the like). Amino acids with similar properties are well known in the 
art. For example, arginine, histidine and lysine are hydrophilic-basic amino acids and may be 
interchangeable. S imilarly, isoleucine, a hydrophobic amino acid, may be replaced with leucme, 
methionine or valine. Such changes are expected to have little or no effect on the apparent 
molecular weight or isoelectric point of the protein or polypeptide. Amino acids oflier than those 
indicated as conserved may differ in a protein or enzyme so that the percent protein or amino acid 
sequence similarity between any two proteins of similar function may vary and may be, for 
example, from 70% to 99% as determined according to an alignment scheme such as by the 
Cluster Method, wherein similarity is based on the MEGALIGN algorithm. A 
"fimction-conservative variant" also includes a polypeptide or enzyme which has at least 60 % 
amino acid identity as determined by BLAST or FASTA algorithms, preferably at least 75%, 
most preferably at least 85%, and even more preferably at least 90%, and which has the same or 
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substantially similar properties or functions as the native or parent protein or enzyme to which 
it is compared. 

As used herein, the term "homologous" in all its grammatical forms and spelling 
variations refers to the relationship between proteins that possess a "common evolutionary 
origin," including proteins from superfamilies {e.g,, the immunoglobulin superfamily) and 
homologous proteins from different species (e.g., myosin light chain, etc.) (Reeck et al. Cell 
1987, 50:667). Such proteins (and their encoding genes) have sequence homology, as reflected 
by their sequence similarity, whether in terms of percent similarity or the presence of specific 
residues or motifs at conserved positions. 

Accordingly, the term "sequence similarity" in all its grammatical forms refers to 
the degree of identity or correspondence between nucleic acid or amino acid sequences of 
proteins that may or may not share a common evolutionary origin (see Reeck et aL, supra). 
However, in common usage and in the instant application, the term "homologous," when 
modified with an adverb such as "highly," may refer to sequence similarity and may or may not 
relate to a common evolutionary origin. 

In a specific embodiment, two DNA sequences are "substantially homologous" 
or "substantially similar" when at least about 80%, and most preferably at least about 90 or 95% 
of the nucleotides match over the defined length of the DNA sequences, as determined by 
sequence comparison algorithnas, such as BLAST, FASTA, DNA Strider, etc. An example of 
such a sequence is an allelic or species variant of the specific H4 gene of the invention. 
Sequences that are substantially homologous can be identified by comparing the sequences using 
standard software available in sequence data banks, or in a Southern hybridization experiment 
xmder, for example, stringent conditions as defined for that particular system. 

Similarly, in a particular embodiment, two amino acid sequences are 
"substantially homologous" or "substantially similar" when greater than 80% of the amino acids 
are identical, or greater than about 90% are similar (functionally identical). Preferably, the 
similar or homologous sequences are identified by aligrmient using, for example, the GCG 
(Genetics Computer Group, Program Manual for the GCG Package, Version 7, Madison, 
Wisconsin) pileup program, or any of the programs described above (BLAST, FASTA, etc) 

A nucleic acid molecule is "hybridizable" to another nucleic acid molecule, such 
as a cDNA, genomic DNA, or RNA, when a single stranded form of the nucleic acid molecule 
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can anneal to the other nucleic acid molecule under the appropriate conditions of temperature and 
solution ionic strength (see Sambrook et aL, supra). The conditions of temperature and ionic 
strength determine the "stringency" of the hybridization. For preliminary screening for 
homologous nucleic acids, low stringency hybridization conditions, corresponding to a Tm 
5 (melting temperature) of 55^*0, can be used, e.g., 5x SSC, 0.1% SDS, 0.25% milk, and no 
formamide; or 30% formamide, 5x SSC, 0.5% SDS. Moderate stringency hybridization 
conditions correspond to a higher Tm, e.g., 40% formamide, with 5x or 6x SSC. High stringency 
hybridization conditions correspond to the highest Tm, e.g., 50% formamide, 5x or 6x SSC. SSC 
is a 0.15M NaCl, 0.01 5M Na-citrate buffer. Hybridization requires that the two nucleic acids 

10 contain complementary sequences, although depending on the stringency of the hybridization, 
mismatches between bases are possible. The appropriate stringency for hybridizing nucleic acids 
depends on the length of the nucleic acids and the degree of complementation, variables well 
known in the art. The greater the degree of similarity or homology between two nucleotide 
sequences, the greater the value of Tm for hybrids of nucleic acids having those sequences. The 

15 relative stability (corresponding to higher Tm) of nucleic acid hybridizations decreases in the 
following order: RNAiRNA, DNAiRNA, DNArDNA. For hybrids of greater than 100 
nucleotides in length, equations for calculating Tm have been derived (see Sambrook et al, 
supra^ 9.50-9.51). For hybridization with shorter nucleic acids, /.e., oligonucleotides, the 
position of mismatches becomes more important, and the length of the oligonucleotide 

20 determines its specificity (see Sambrook et al , supra, 11.7-1 1.8). A minimum length for a 
hybridizable nucleic acid is at least about 1 0 nucleotides; preferably at least about 1 5 nucleotides; 
and more preferably the. length is at least about 20 nucleotides. 

In a specific embodiment, the term "standard hybridization conditions" refers to 
a Tm of 55 ""C, and utilizes conditions as set forth above. In a preferred embodiment, the Tm is 

25 60 ""C; in a more preferred embodiment, the Tm is 65 °C. In a specific embodiment, "high 
stringency'' refers to hybridization and/or washing conditions at 68 °C in 0.2xSSC, at 42°C in 
50% formamide, 4xSSC, or under conditions that afford levels of hybridization equivalent to 
those observed under either of these two conditions. 

As used herein, the term "oligonucleotide" refers to a nucleic acid, generally of 

30 at least 1 0, preferably at least 1 5, and more preferably at least 20 nucleotides, preferably no more 
than 1 00 nucleotides, that is hybridizable to a genomic DNA molecule, a cDNA molecule, or an 
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niRNA molecule encoding a gene, mRNA, cDNA, or other nucleic acid of interest. 
Oligonucleotides can be labeled, e.g., with ^^P-nucleotides or nucleotides to which a label, such 
as biotin, has been covalently conjugated. In one embodiment, a labeled oligonucleotide can be 
' used as a probe to detect the presence of a nucleic acid. In another embodiment, oligonucleotides 

5 (one or both of which may be labeled) can be used as PGR primers, either for cloning full length 
or a fragment of H4, or to detect the presence of nucleic acids encoding H4- In a further 
embodiment, an oligonucleotide of the invention can form a triple helix with a H4 DNA 
molecule. Generally, oligonucleotides are prepared synthetically, preferably on a nucleic acid 
synthesizer. Accordingly, oligonucleotides can be prepared with non-naturally occurring 
1 0 phosphoester analog bonds, such as thioester bonds, etc. 

The present invention provides antisense nucleic acids (including ribozymes), 
which may be used to inhibit expression of H4 of the invention. Inhibition of H4 expression may 
be desired when upregulation of H4 receptor expression or excessive inhibition of cAMP 
formation induces disease states such as, transplant organ rejection; asthma; allergies; 
15 autoimmune pathologies, such as, but not limited to, multiple sclerosis, type I diabetes, and 
rheumatoid arthritis; and CNS functions such as cognitive and memory defects. An "antisense 
nucleic acid" is a single stranded nucleic acid molecule which, on hybridizing under cytoplasmic 
j conditions with complementary bases in an RNA or DNA molecule, inhibits the latter's role. If 

I the RNA is a messenger RNA transcript, the antisense nucleic acid is a countertranscript or 

I 20 mRNA-interfering complementary nucleic acid. As presently used, "antisense" broadly includes 

RNA-RNA interactions, RNA-DNA interactions, ribozymes and RNase-H mediated arrest. 
Antisense nucleic acid molecules can be encoded by a recombinant gene for expression in a cell 
(e.g., U.S. Patent No. 5,814,500; U.S. Patent No. 5,811,234), or alternatively they can be 
prepared synthetically (e.g„ U.S. Patent No. 5,780,607). 
25 Specific non-limiting examples of synthetic oligonucleotides envisioned for this 

invention include oligonucleotides that contain phosphorothioates, phosphotriesters, methyl 
phosphonates, short chain alkyl, or cycloalkyl intersugar linkages or short chain heteroatomic or 
heterocyclic intersugar linkages. Most preferred are those with CHj-NH-O-CHj, CH2-N(CH3)-0- 
CH2, CH2-0-N(CH3)-CH2, CH2-N(CH3)-N(CH3)-CH2 and 0-N(CH3)-CH2-CH2 backbones (where 
30 phosphodiester is O-POj-O-CHj). U.S. Patent No. 5,677,437 describes heteroaromatic 
olignucleoside linkages. Nitrogen linkers or groups containing nitrogen can also be used to 

i 

I 
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prepare oligonucleotide miniics (U.S. Patents No. 5,792,844 and No. 5,783,682). U.S. Patent 
No. 5,637,684 describes phosphoramidate and phosphorothioamidate oligomeric compounds. 
Also envisioned are oligonucleotides having morpholino backbone structures (U.S. Patent No. 
5,034,506). In other embodiments, such as the peptide-nucleic acid (PNA) backbone, the 

5 phosphodiester backbone of the oligonucleotide may be replaced with a polyamide backbone, 
the bases being bound directly or indirectly to the aza nitrogen atoms of the polyamide backbone 
(Nielsen et al. Science 254:1497, 1991). Other synthetic oligonucleotides may contain 
substituted sugar moieties comprising one of the following at the 2' position: OH, SH, SCH3, F, 
OCN, 0(CH2)„NH2 or 0(CH2)„CH3 where n is from 1 to about 10; CI to CIO lower alkyl, 

10 substituted lower alkyl, alkaryl or aralkyl; CI; Br; CN; CF3; OCF3; 0-; S-, or N-alkyl; 0-, S-, or 
N-alkenyl; SOCH3; SO2CH3; ONOj; NO^; N3; NH^; heterocycloalkyl; heterocycloalkaryl; 
aminoalkylamino; polyalkylamino; substitued silyl; a fluorescein moiety; an RNA cleaving 
group; a reporter group; an intercalator; a group for improving the pharmacokinetic properties 
of an oligonucleotide; or a group for improving the pharmacodynamic properties of an 

15 oligonucleotide, and other substituents having similar properties. Oligonucleotides may also 
have sugar mimetics such as cyclobutyls or other carbocyclics in place of the pentofuranosyl 
group. Nucleotide units having nucleosides other than adenosine, cytidine, guanosine, thymidine 
and uridine, such as inosine, may be used in an oligonucleotide molecule. 

Nucleic Acids 

20 A gene encoding H4, whether genomic DNA or cDNA, can be isolated from any 

source, particxilarly from a himaan cDNA or genomic library. Methods for obtaining H4 gene are 
well known in the art, as described above (see, e.g., Sambrook et aL, 1989, supra). The DNA 
may be obtained by standard procedures known in the art from cloned DNA (e.g., a DNA 
"library")* and preferably is obtained from a cDNA library prepared from tissues with high level 

25 expression of the protein, by chemical synthesis, by cDNA cloning, or by the cloning of genomic 
DNA, or fragments thereof, ptirified from the desired cell (See, for example, Sambrook et ah, 
1989, supra\ Glover, D.M. (ed.), 1985, DNA Cloning: A Practical Approach, MRL Press, Ltd., 
Oxford, U.K. Vol. I, II). Clones derived from genomic DNA may contain regulatory and intron 
DNA regions in addition to coding regions; clones derived from cDNA will not contain intron 

30 sequences. Whatever the source, the gene should be molecularly cloned into a suitable vector 
for propagation of the gene. Identification of the specific DNA fragment containing the desired 
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H4 gene may be accomplished in a number of ways. For example, a portion of a H4 gene 
exemplified infi-a can be purified and labeled to prepare a labeled probe, and the generated DN A 
may be screened by nucleic acid hybridization to the labeled probe (Benton and Davis, Science 
1977, 196:180; Grunstein and Hogness, Proc. Natl. Acad. Sci. U.S.A. 1975, 72:3961). Those 
DNA fragments with substantial homology to the probe, such as an allelic variant from another 
individual, will hybridize, hi a specific embodiment, highest stringency hybridization conditions 
are used to identify a homologous H4 gene. 

Further selection can be carried out on the basis of the properties of the gene, e.g., 
if the gene encodes a protein product having the isoelectric, electrophoretic, amino acid 
composition, partial or complete amino acid sequence, antibody binding activity, or ligand 
binding profile of H4 protein as disclosed herein. Thus, the presence of the gene may be detected 
by assays based on the physical, chemical, inmiunological, or functional properties of its 
expressed product. 

Other DNA sequences which encode substantially the same amino acid sequence 
as a H4 gene may be used in the practice of the present invention. These include but are not 
limited to allelic variants, species variants, sequence conservative variants, and ftinctional 
variants. 

Amino acid substitutions may also be introduced to substitute an amino acid with 
a particularly preferable property. For example, a Cys may be introduced a potential site for 
disulfide bridges with another Cys. 

The genes encoding H4 derivatives and analogs of the invention can be produced 
by various methods known in the art. The manipulations which result in their production can 
occur at the gene or protein level. For example, the cloned H4 gene sequence can be modified 
by any of nxxmerous strategies known in the art (Sambrook et al^ 1989, supra). The sequence 
can be cleaved at appropriate sites with restriction endonuclease(s), followed by further 
enzymatic modification if desired, isolated, and ligated in vitro, hi the production of the gene 
encoding a derivative or analog of H4, care should be taken to ensure that the modified gene 
remains within the same translational reading frame as the H4 gene, uninterrupted by translational 
stop signals, in the gene region where the desired activity is encoded. 

Additionally, the v-encoding nucleic acid sequence can be mutated in vitro or in 
vivo J to create and/or destroy translation, initiation, and/or termination sequences, or to create 
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variations in coding regions and/or form new restriction endonuclease sites or destroy preexisting 
ones, to facilitate further in vitro modification. Such modifications can be made to introduce 
restriction sites and facilitate cloning the H4 gene into an expression vector. Any technique for 
mutagenesis known in the art can be used, including but not limited to, in vitro site-directed 
5 mutagenesis (Hutchinson, C, et al, J. Biol. Chem. 253:6551, 1978; Zoller and Smith, DNA 
3:479-488, 1984; Oliphant et al. Gene 44:177, 1986; Hutchinson et aL, Proc, Natl. Acad. Sci, 
U.S.A. 83:710, 1986), use of TAB" linkers (Pharmacia), etc. PGR techniques are preferred for 
site directed mutagenesis (see Higuchi, 1989, "Using PGR to Engineer DNA", in PGR 
Technology: Principles and Applications for DNA Amplification, H. Erlich, ed., Stockton Press, 

10 Ghapter 6, pp. 61-70). 

The identified and isolated gene can then be inserted into an appropriate cloning 
vector. A large number of vector-host systems known in the art may be used. Possible vectors 
include, but are not limited to, plasmids or modified viruses, but the vector system must be 
compatible with the host cell used. Examples of vectors include, but are not limited to, E. coli, 

15 bacteriophages such as lambda derivatives, or plasmids such as Bluescript, pBR322 derivatives 
or pUC plasmid derivatives, e.g., pGEX vectors, pmal-c, pFLAG, etc. The insertion into a 
cloning vector can, for example, be accomplished by ligating the DNA fi-agment into a cloning 
vector which has complementary cohesive termiru. However, if the complementary restriction 
sites used to fragment the DNA are not present in the cloning vector, the ends of the DNA 

20 molecules may be enzymatically modified. Alternatively, any site desired may be produced by 
ligating nucleotide sequences (linkers) onto the DNA termini; these ligated linkers may comprise 
specific chemically synthesized oligonucleotides encoding restriction endonuclease recognition 
sequences. In addition, simple PGR or overlapping PGR may be used to insert a fragment into 
a cloning vector. 

25 Recombinant molecules can be introduced into host cells via transformation, 

transfection, infection, electroporation, etc., so that many copies of the gene sequence are 
generated. Preferably, the cloned gene is contained on a shuttle vector plasmid, which provides 
for expansion in a cloning cell, e,g,, E. coli, and facile purification for subsequent insertion into 
an appropriate expression cell line, if such is desired. For example, a shuttle vector, which is a 

30 vector that can replicate in more tihan one type of organism, can be prepared for replication in 
both E. coli and Saccharomyces cerevisiae by linking sequences fi:om an E, coli plasmid with 
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sequences form the yeast 2/^ plasmid. 

Regulatory Nucleic Acids 
Elements of the H4 promoter can be identified by scanning the human genomic 
region upstream of the H4 start site, e.g., by creating deletion mutants and checking for 
5 expression, or with the TRANSFAC algorithm. Sequences up to about 6 kilobases (kb) or more 
upstream from the H4 start site can contain tissue-specific regulatory elements. 

The term "H4 promoter" encompasses artificial promoters. Such promoters can 
be prepared by deleting nonessential intervening sequences fi'om the upstream region of the H4 
promoter, or by joining upstream regulatory elements from tlie H4 promoter with a heterologous 
1 0 minimal promoter, such as the CMV immediate early promoter. 

An H4 promoter can be operably associated with a heterogenous coding sequence, 
e.g., for reporter gene (luciferase and green fluorescent proteins are examples of reporter genes) 
in a construct. This construct will result in expression of the heterologous coding sequence under 
control the H4 promoter, e.g., a reporter gene can be expressed, xmder conditions that under 
1 5 normal conditions cause H4 expression. This construct can be used in screening assays, described 
below, for H4 agonists and antagonists. 

Expression of Polypeptides 
The nucleotide sequence coding for H4, or antigenic fi-agment, derivative or analog 
thereof, or a fiinctionally active derivative, including a chimeric protein, thereof, can be inserted 
20 into an appropriate expression vector, /. e. , a vector which contains the necessary elements for the 
transcription and translation of the inserted protein-coding sequence. Thus, a nucleic acid 
encoding H4 of the invention can be operationally associated vsdth a promoter in an expression 
vector of the invention. Both cDNA and genomic sequences can be cloned and expressed vmder 
control of such regulatory sequences. Such vectors can be used to express functional or 
25 functionally inactivated H4 polypeptides. 

The necessary transcriptional and translational signals can be provided on a 
recombinant expression vector, or they may be supplied by the native gene encoding H4 and/or 
its flanking regions. 

Potential host-vector systems include but are not limited to mammalian cell 
30 systems transfected with expression plasmids or infected with virus (e.g., vaccinia virus, 
adenovims, adeno-associated virus, herpes virus, etc); insect cell systems infected with virus 
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{e.g„ baculovirus); microorganisms such as yeast containing yeast vectors; or bacteria 
transformed with bacteriophage, DNA, plasmid DNA, or cosmid DNA. The expression 
elements of vectors vary in their strengths and specificities. Depending on the host-vector system 
utiUzed, any one of a number of suitable transcription and translation elements may be used. 
5 Expression of H4 protein may be controlled by any promoter/enhancer element 

known in the art, but these regulatory elements must be functional in the host selected for 
expression. Promoters which may be used to control H4 gene expression include, but are not 
limited to, cytomegalovirus (CMV) promoter (U.S. Patent Nos. 5,385,839 and 5,168,062), the 
SV40 early promoter region (Benoist and Chambon, 1981, Nature 290:304-310), the promoter 

10 contained in the 3' long terminal repeat of Rous sarcoma virus (Yamamoto, et aL, Cell 22:787- 
797, 1980), the herpes thymidine kinase promoter (Wagner et aL, Proc. Natl. Acad. Sci. U.SA. 
78:1441-1445, 1981), the regulatory sequences of the metallothionein geiie (Brinster et al. 
Nature 296:39-42, 1982); prokaryotic expression vectors such as the p-lactamase promoter 
(Villa-Komarofif, etal, Proc. Natl. Acad. Sci. U.SA. 75:3727-3731, 1978), or the tac promoter 

15 (DeBoer, et al, Proc. Natl. Acad. Sci. U.S.A. 80:21-25, 1983); see also "Useful proteins from 
recombinant bacteria" in Scientific American, 242:74-94, 1980; promoter elements from yeast 
or other fungi such as the Gal 4 promoter, the ADC (alcohol dehydrogenase) promoter, PGK 
(phosphoglycerol kinase) promoter, alkaline phosphatase promoter; and transcriptional control 
regions that exhibit tissue specificity, particularly endothelial cell-specific promoters. 

20 Solubilized forms of the protein can be obtained by solubilizing inclusion bodies 

or reconstituting membrane components, e.g., by treatment with detergent, and if desired 
sonication or other mechanical processes, as described above. The solubiUzed protein can be 
isolated using various techniques, such as polyacrylamide gel electrophoresis (PAGE), isoelectric 
focusing, 2-dimensional gel electrophoresis, chromatography (e.g., ion exchange, affinity, 

25 inmiunoaffinity, and sizing colunrn chromatography), centrifugation, differential solubility, 
immunoprecipitation, or by any other standard technique for the purification of proteins. 

Vectors 

A wide variety of host/expression vector combinations may be employed in 
expressing the DNA sequences of this invention. Useful expression vectors, for example, may 
30 consist of segments of chromosomal, non-chromosomal and synthetic DNA sequences. Suitable 
vectors include derivatives of SV40 and known bacterial plasnodds, e.g., E. coli plasmids col El, 
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pCRl, pBR322, pMal-C2, pET, pGEX (Smith et al. Gene 67:31-40, 1988), pMB9 and their 
derivatives, plasmids such as RP4; phage DNAS, e.g., the numerous derivatives of phage I, e.g,, 
NM989, and other phage DNA, e.g.. Ml 3 and filamentous single stranded phage DNA; yeast 
plasmids such as the 2/^ plasmid or derivatives thereof; vectors useful in eukaryotic cells, such 
as vectors useful in insect or mammalian cells; vectors derived from combinations of plasmids 
and phage DNAs, such as plasmids that have been modified to employ phage DNA or other 
expression control sequences; and the like. 

Viral vectors, such as lentiviruses, retroviruses, herpes viruses, adenoviruses, 
adeno-associated viruses, vaccinia virus, baculovirus, alphavirus, and other recombinant viruses 
with desirable cellular tropism are also useful. Thus, a gene encoding a functional or mutant H4 
protein or polypeptide domain fragment thereof can be introduced in v/vo, ex vivo, or in vitro 
using a viral vector or through direct introduction of DNA. Expression in targeted tissues can 
be effected by targeting the transgenic vector to specific cells, such as with a viral vector or a 
receptor ligand, or by using a tissue-specific promoter, or both. Targeted gene delivery is 
described in Intemational Patent Publication WO 95/28494, published October 1 995, 

Viral vectors commonly used for in vivo or ex vivo targeting and therapy 
procedures are DNA-based vectors and retroviral vectors. Methods for constructing and using 
viral vectors are known in the art (see, e.g.. Miller and Rosman, BioTechniques 1992, 7:980- 
990). Preferably, the viral vectors are replication defective, that is, they are unable to replicate 
autonomously in the target cell. In general, the genome of the replication defective viral vectors 
which are used within the scope of the present invention lack at least one region which is 
necessary for the replication of the virus in the infected cell. These regions can either be 
eliminated (in whole or in part) or be rendered non-functional by any technique known to a 
person skilled in the art. These techniques include the total removal, substitution (by other 
sequences, in particular by the inserted nucleic acid), partial deletion or addition of one or more 
bases to an essential (for replication) region. Such techniques may be performed in vitro (on the 
isolated DNA) or in situ, using the techniques of genetic manipulation or by treatment with 
mutagenic agents. Preferably, the replication defective virus retains the sequences of its genome 
which are necessary for encapsidating the viral particles. 

DNA viral vectors include an attenuated or defective DNA virus, such as but not 
limited to herpes simplex virus (HS V), papillomavirus, Epstein Barr vims (EBV), adenovirus. 
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adeno-associated virus (AAV), and the like. Defective viruses, which entirely or almost entirely 
lack viral genes, are preferred. Defective virus is not infective after introduction into a cell. Use 
of defective viral vectors allows for administration to cells in a specific, localized area, without 
concern that the vector can infect other cells. Thus, a specific tissue can be specifically targeted. 

5 Examples of particular vectors include, but are not limited to, a defective herpes virus 1 (HS V 1 ) 
vector (Kaplitt et al, Molec. Cell. Neurosci. 2:320-330, 1991), defective herpes virus vector 
lacking a glyco-protein L gene (Patent Publication RD 3 7 1 005 A), or other defective herpes virus 
vectors (International Patent Publication No. WO 94/21807, published September 29, 1994; 
International Patent Publication No. WO 92/05263, pubhshed April 2, 1994); an attenuated 

10 adenovirus vector, such as the vector described by Stratford-Perricaudet et al (J. Clin. Invest. 
90:626-630, 1992; see also La Salle etal. Science 259:988-990, 1993); and a defective adeno- 
associated virus vector (Samulski etal, J. ViroL 61:3096-3101, 1987; Samulski etalj. Virol. 
63:3822-3828, 1989; Lebkowski et al, MoL Cell. Biol. 8:3988-3996, 1988). 

Various companies produce viral vectors commercially, including but by no 

15 means limited to Avigen, Inc. (Alameda, CA; AAV vectors). Cell Genesys (Foster City, CA; 
retroviral, adenoviral, AAV vectors, £md lentiviral vectors), Clontech (retroviral and baculoviral 
vectors), Genovo, Inc. (Sharon Hill, PA; adenoviral and AAV vectors), Genvec (adenoviral 
vectors), IntroGene (Leiden, Netherlands; adenoviral vectors), Molecular Medicine (retroviral, 
adenoviral, AAV, and herpes viral vectors), Norgen (adenoviral vectors), Oxford BioMedica 

20 (Oxford, United Kingdom; lentiviral vectors), and Transgene (Strasbourg, France; adenoviral, 
vaccinia, retroviral, and lentiviral vectors). 

Preferably, for in vivo administration, an appropriate immunosuppressive 
treatment is employed in conjunction with the viral vector, e.g., adenovirus vector, to avoid 
inununo-deactivation of the viral vector and transfected cells. For example, immunosuppressive 

25 cytokines, such as interleukin-12 (IL-12), interferon-y (IFN-y), or anti-CD4 antibody, can be 
provided to block humoral or cellular immune responses to the viral vectors (see, e.g., Wilson, 
Nature Medicine, 1995). In that regard, it is advantageous to employ a viral vector that is 
engineered to express a minimal number of antigens. 

In another embodiment, the vector can be introduced in vivo by lipofection, as 

30 naked DNA, or with other transfection facilitating agents (peptides, polymers, etc.). Synthetic 
cationic lipids can be used to prepare liposomes for in vivo transfection of a gene encoding a 
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marker (Feigner, et. al.,Proc.Natl. Acad. Sci. U.S.A. 84:7413-7417, 1987; Feigner and Ringold, 
Science 337:387-388, 1989; see Mackey, et al, Proc. Natl. Acad. Sci. U.S.A. 85:8027-8031, 
1 988; Ulmer, et al , Science 259: 1 745-1 748, 1 993). Useful lipid compounds and compositions 
for transfer of nucleic acids are described in International Patent Publications WO 95/1 8863 and 
WO 96/17823, and in U.S. Patent No. 5,459,127. Lipids may be chemically coupled to other 
molecules for the purpose of targeting (see Mackey, et al^ supra). Targeted peptides, e.g., 
hormones or neurotransmitters, and proteins such as antibodies, or non-peptide molecules could 
be coupled to liposomes chemically. 

Other molecules are also useful for facilitating transfection of a nucleic acid in 
v/vo, such as a cationic oligopeptide (e.g., Intemational Patent Publication WO 95/21931), 
peptides derived from DNA binding proteins {e.g., Intemational Patent Publication WO 
96/25508), or a cationic polymer (e.g., Intemational Patent Publication W095/21931). 

Altematively, non-viral DNA vectors for gene therapy can be introduced into the 
desired host cells by methods known in the art, e.g., electroporation, microinjection, cell fusion, 
DEAE dextran, calcium phosphate precipitation, use of a gene gun (ballistic transfection; see, 
e.g., U.S. Pat. No. 5,204,253, U.S. Pat. No. 5,853,663, U.S. Pat. No. 5,885,795, andU.S. Pat. No. 
5,702,384 and see Sanford, TIB-TECH, 6:299-302, 1988; Fynan et al, Proc. Natl. Acad. Sci. 
U.S.A., 90:1 1478-1 1482, 1993; and Yang et al, Proc. Natl. Acad. Sci. U.S.A., 87:1568-9572, 
1990), or use of a DNA vector transporter (see, e.g., Wu, et al, J. Biol. Chem. 267:963-967, 
1992; Wu and Wu, J. Biol. Chem. 263:14621-14624, 1988; Hartmut, et al, Canadian Patent 
Application No. 2,012,311, filed March 15, 1990; Williams, etal, Proc. Natl. Acad. Sci. USA 
88:2726-2730, 1991). Receptor-mediated DNA delivery approaches can also be used (Curiel, 
etal. Hum. Gene Then 3:147-154, 1992; Wu and Wu, J. Biol. Chem. 262:4429-4432, 1987). 
U.S. Patent Nos. 5,580,859 and 5,589,466 disclose delivery of exogenous DNA sequences, free 
of transfection facilitating agents, in a mammal. Recently, a relatively low voltage, liigh 
efficiency in vivo DNA transfer technique, termed electrotransfer, has been described (Mir, et al , 
CP. Acad. Sci., 321:893, 1998; WO 99/01157; WO 99/01158; WO 99/01175). 



Ligamids and Binding Fartmiers 
The present invention further permits identification of physiological ligands and 
binding partners of H4. One method for evaluating and identifying H4 binding partners is the 
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yeast two-hybrid screen. Preferably, the yeast two-hybrid screen is performed using an cell 
library with yeast that are transformed with recombinant H4. Alternatively, H4 can be used as a 
capture or affinity purification reagent. In another altemative, labeled H4 can be used as a probe 
for binding, e.g., by immunoprecipitation or Western analysis. Expected H4 bmding partners are 
G-proteins. 

Generally, binding interactions between H4 and any of its binding partners will 
be strongest under conditions approximating those found in the cytoplasm, i.e., physiological 
conditions of ionic strength, pH and temperature. Perturbation of these conditions will tend to 
disrupt the stability of a binding interaction. 

Antibodies to 

Antibodies to H4 are usefiil, inter alia, for diagnostics and intracellular regulation 
of H4 activity, as set forth below. According to the invention, a H4 polypeptide produced 
recombinantly or by chemical synthesis, and fragnients or other derivatives or analogs thereof, 
including fusion proteins, may be vised as immunogens to generate antibodies that recognize the 
H4 polypeptide. Such antibodies include but are not limited to polyclonal, monoclonal, chimeric, 
single chain. Fab fragments, and an Fab expression library. Such an antibody is preferfably 
specific for himian H4 and it may recognize either a mutant form of H4 or wild-type H4, or both. 

One can use the hydropathic index of amino acids, as discussed by Kyte and 
Doolittle(JMolBiol. 1982, 157:105-132). See,for example, U.S. Patent 4,554,101, which states 
that the greatest local average hydrophilicity of a "protein," as governed by the hydrophilicity of 
its adjacent amino acids, correlates with its immunogenicity. Accordingly, it is noted that 
substitutions can be made based on the hydrophilicity assigned to each amino acid. In using 
either the hydrophilicity index or hydropathic index, which assigns values to each amino acid, 
it is preferred to introduce substitutions of amino acids where these values are ± 2, with ± 1 being 
particularly preferred, and those within ± 0.5 being tiie most preferred substitutions. 

Various procedures knovm in the art may be used for the production of polyclonal 
antibodies to H4 polypeptide or derivative or analog thereof. For the production of antibody, 
various host animals can be immunized by injection with the H4 polypeptide, or a derivative (e.g., 
firagment or fusion protein) thereof, including but not limited to rabbits, mice, rats, sheep, goats, 
etc. In one embodiment, the H4 polypeptide or firagment thereof can be conjugated to an 
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immunogenic carrier, e.g,, bovine serum albumin (BSA) or keyhole limpet hemocyanin (KLH). 
Various adjuvants may be used to increase the immunological response, depending on the host 
species, including but not limited to Freund*s (complete and incomplete), mineral gels such as 
aluminum hydroxide, surface active substances such as lysolecithin, pluronic polyols, polyanions, 
peptides, oil emulsions, keyhole limpet hemocyanins, dinitrophenol, and potentially useful 
human adjuvants such as BCG (bacille Calmette-Guerin) and Corynebacterium parviun. 

For preparation of monoclonal antibodies directed toward the H4 polypeptide, or 
fragment, analog, or derivative thereof, any technique that provides for the production of 
antibody molecules by continuous cell lines in culture may be used. These include but are not 
limited to the hybridoma technique originally developed by Kohler and Milstein (Nature 1975, 
256:495-497), as well as the trioma technique, the human B-cell hybridoma technique (Kozbor 
etaL, Immunology Today 1983, 4:72; Cote etaL, Proc. Natl. Acad. Sci. 1983, 80:2026-2030), 
and the EBV-hybridoma technique to produce human monoclonal antibodies (Cole et aL^ in 
Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., 1985, pp. 77-96). In an 
additional embodiment of the invention, monoclonal antibodies can be produced in germ-free 
animals (Intemational Patent Publication No. WO 89/12690). In fact, according to the invention, 
techniques developed for the production of "chimeric antibodies" (Morrison et aL, J. Bacteriol. 
1984, 159:870; Neuberger et aL, Nature 1984, 312:604-608; Takeda et aL, Nature 1985, 
314:452-454) by splicing the genes from a mouse antibody molecule specific for an H4 
polypeptide together with genes from a human antibody molecule of appropriate biological 
activity can be used; such antibodies are within the scope of this invention. Such human or 
hiunanized chimeric antibodies are preferred for use in therapy of himnian diseases or disorders 
(described infi'a\ since the human or humanized antibodies are much less likely than xenogenic 
antibodies to induce an immme response, in particular an allergic response, themselves. 

Antibody fragments which contain the idiotype of the antibody molecule can be 
generated by known techniques. For example, such fragments include but are not limited to: the 
F(ab')2 fragment which can be produced by pepsin digestion of the antibody molecule; the Fab* 
fragments which can be generated by reducing the disulfide bridges of the F(ab*)2 fragment, and 
the Fab fragments which can be generated by treating the antibody molecule with papain and a 
reducing agent. 

According to the invention, techniques described for the production of single 



I 
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chain antibodies (U.S. Patent Nos. 5,476,786, 5,132,405, and U.S. Patent 4,946,778) can be 
adapted to produce H4 polypeptide-specific single chain antibodies. An additional embodiment 
of the invention utilizes the techniques described for the construction of Fab expression libraries 
(Huse et al , Science 1 989, 246 : 1 275- 1 28 1 ) to allow rapid and easy identification of monoclonal 
5 Fab fragments with the desired specificity for an H4 polypeptide, or its derivatives, or analogs. 

In the production and use of antibodies, screening for or testing with the desired 
antibody can be accomplished by techniques known in the art, e.g., radioimmiinoassay, ELISA 
(enzyme-linked immunosorbant assay), "sandwich" immunoassays, immimoradiometric assays, 
gel diffusion precipitin reactions, immunodiffusion assays, in situ immimoassays (using colloidal 

10 gold, enzyme or radioisotope labels, for example), Westem blots, precipitation reactions, 
agglutination assays {e.g„ gel agglutination assays, hemagglutination assays), complement 
fixation assays, immunofluorescence assays, protein A assays, and Immunoelectrophoresis 
assays, etc. In one embodiment, antibody binding is detected by detecting a label on the primary 
antibody. In another embodiment, the primary antibody is detected by detecting bindiiig of a 

1 5 secondary antibody or reagent to the primary antibody. In a further embodiment, the secondary 
antibody is labeled. Many means are known in the art for detecting binding in an immunoassay 
and are within the scope of the present invention. For example, to select antibodies which 
recognize a specific epitope of an H4 polypeptide, one may assay generated hybridomas for a. 
product which binds to an H4 polypeptide fragment containing such epitope. For selection of an 

20 antibody specific to an H4 polypeptide from a particular species of animal, one can select on the 
basis of positive binding with H4 polypeptide expressed by or isolated from cells of that species 
of animal. 

The foregoing antibodies can be used in methods known in the art relating to the 
localization and activity of the H4 polypeptide, e.g„ for Westem blotting, imaging H4 polypeptide 

25 in situ, measuring levels thereof in appropriate physiological samples, etc. using any of the 
detection techniques mentioned above or known in the art. Such antibodies can also be used in 
assays for ligand binding, e.g., as described in U.S. Patent No. 5,679,582. Antibody binding 
generally occurs most readily under physiological conditions, e.g., pH of between about 7 and 
8, and physiological ionic strength. The presence of a carrier protein in the buffer solutions 

30 stabilizes the assays. While there is some tolerance ofperturbation of optimal conditions, e.g., 
increasing or decreasing ionic strength, temperature, or pH, or adding detergents or chaotropic 
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salts, such perturbations will decrease binding stability. 

In a specific embodiment, antibodies that act as ligands and agonize or antagonize 
the activity of H4 polypeptide can be generated. In addition, intracellular single chain Fv 
antibodies can be used to regulate cAMP fomiation (Marasco et al., Proc. Natl. Acad. Sci. U.S.A. 
5 1993, 90:7884-7893; Chen., Mol. Med. Today 1997, 3:160-167; Spitz et al.. Anticancer Res. 
1996, 16:3415-22; Indolfi et al., Nat. Med. 1996, 2:634-635; Kijma et al., Pharmacol. Ther. 
1 995, 68:247-267). Such antibodies can be tested using the assays described infra for identifying 
ligands. 



10 Screening and Chemistry 

According to the present invention, nucleotide sequences encoding H4 and the H4 
receptor structure, which can be modeled from the amino acid sequence based on homology to 
other GPCR proteins, are useful targets to identify dmgs that are eflfective in treating disorders 
associated with histamine-regulated processes. Drug targets include without limitation (i) 

1 5 isolated nucleic acids derived from the gene encoding H4 (e.g., antisense or ribozyme molecules) 
and (ii)small molecule compounds that recognize and bind the receptor. 

In particular, identification and isolation of H4 provides for development of 
screening assays, particularly for high throughput screening of molecules that up- or down- 
regulate the activity of H4. Accordingly, the present invention contemplates methods for 

20 identifying specific histamine receptor ligands that interact with H4 receptors, using various 
screening assays known in the art. 

Any screening technique known in the art can be used to screen for H4 agonists 
or antagonists. The present invention contemplates screens for small moleciile ligands or ligand 
analogs and mimics, as well as screens for natural ligands that bind to and agonize or antagonize 

25 H4 activity in vivo. For example, natural products libraries can be screened using assays of the 
invention for molecules that agonize or antagonize H4 expression or activity. 

Another approach uses recombinant bacteriophage to produce large libraries. 
Using the "phage method" (Scott and Smith, Science 1990, 249:386-390; Cwirla, et al, Proc. 
Natl. Acad. Sci,, USA 1990, 87:6378-6382; Devlin er^/.. Science 1990, 49:404-406), very large 

30 libraries can be constructed (106-108 chemical entities). A second approach uses primarily 
chemical methods, of which the Geysen method (Geysen et al. Molecular Immunology 1986, 
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23:709-715; GeyseneraZ. J. Immunologic Method 1987 102:259-274; and the method of Fodor 
etal (Science 1991, 251:767-773) are examples. Furka etal (14th Intemational Congress of 
Biochemistry, Volume #5 1988, Abstract FR:013; Furka, Int. J- Peptide Protein Res. 1991, 
37:487-493), Houghton (U.S. Patent No. 4,631,211) andRutter (U.S. Patent No. 5,010,175) 
5 describe methods to produce a mixture of peptides that can be tested as agonists or antagonists. 

In another aspect, synthetic Hbraries (Needeis et aL^ Proc. Natl. Acad, Sci. USA 
1993, 90:10700-4; Ohlmeyer et al, Proc. Natl. Acad. Sci. USA 1993, 90:10922-10926; Lam 
^/^z/., PCT Publication No. WO 92/00252; Kocis etal, PCT Publication No. WO 9428028) and 
the like can be used to screen for ligands that regulate H4 activity. Test compounds are screened 

10 from large libraries of synthetic or natural compounds. Numerous means are currently vised for 
random and directed synthesis of saccharide, peptide, and nucleic acid based compounds. 
Synthetic compound libraries are commercially available from Maybridge Chemical Co. 
(Trevillet, Cornwall, UK), Comgenex (Princeton, NJ), Brandon Associates (Merrimack, NH), 
and Microsource (New Milford, CT). A rare chemical library is available from Aldrich 

1 5 (Milwaukee, WI). Alternatively, libraries of natural compounds in the form of bacterial, fungal, 
plant and animal extracts are available from e,g. Pan Laboratories (Bothell, WA) or MycoSearch 
(NC), or are readily producible. Additionally, natural and synthetically produced libraries and 
compounds are readily modified through conventional chemical, physical, and biochemical 
means (Blondelle et al, Tib Tech 1996, 14:60). 

20 Knowledge of the primary sequence of H4, and the similarity of that sequence with 

proteins of known function, can provide an initial clue as to the structure of agonists or 
antagonists of the receptor. Identification and screening of agonists antagonists is further 
facilitated by determining structural features of the receptor, e.g., using X-ray crystallography, 
neutron diffraction, nuclear magnetic resonance spectrometry, homology studies, structure- 

25 activity relationships, and other techniques for structure determination. These techniques provide 
for the rational design or identification of agonists and antagonists. 

One technique that may be used to assess the affinity of a test compound for the 
H4 receptor is a competition binding assay. In this assay, test wells containing an aliquot of a 
lipid bilayer membranes that contain the histamine H4 receptor are incubated with an known 

3 0 concentration of a radiolabeled ligand for the receptor. The lipid bilayer may be prepared by any 
known protocol that separates the membrane containing receptor component from the 
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cytoplasmic components. Each well also is incubated with a different concentration of a 
unlabeled test compound. Cell membranes are then separated from the incubation mixture by 
any method known in the art including, but not limited to, centrifugation and vacuum filtration 
on a cell harvester. The radioactivity of each well is then determined using any device that can 
detect radioactivity, such as a scintillation counter. As increasing concentrations of the test 
compound compete for the receptor binding site, the radioactivity detected decreases. The data 
then can be converted using the Cheng-Prusoff equation (Biochem Pharmacol. 1973, 
22:3099-3108) to determine the affinity (K-) of the compound for the receptor. 

In vivo screening met/tods 

Intact cells or whole animals expressing a gene encoding H4 can be used in 
screening methods to identify candidate drugs. 

In one series of embodiments, a permanent cell line is established. Alternatively, 
cells (including without limitation mammalian, insect, yeast, or bacterial cells) are transiently 
programmed to express an H4 gene by introduction of appropriate DN A or mRNA. Identification 
of candidate compoimds can be achieved using any suitable assay, including without limitation 
(i) assays that measure binding of test compoxmds to H4 (ii) assays that measure the ability of a 
test compound to modify (z. e. , inhibit or enhance) a measurable activity or function of H4 and (iii) 
assays that measure the ability of a compound to modify (te,, inhibit or enhemce) the 
transcriptional activity of sequences derived from the promoter (z.e., regulatory) regions of the 
H4 gene. 

H4 knockout mammals can be prepared for evaluating the molecular pathology 
of this defect in greater detail than is possible with human subjects. Such animals also provide 
excellent models for screening drag candidates. A "knockout mammal" is an mammal (e.g., 
mouse, rabbit) that contains within its genome a specific gene that has been inactivated by the 
method of gene targeting (see, e.g., U.S. Patent Nos. 5,777,195 and 5,616,491). A knockout 
mammal includes both a heterozygote knockout (i,e., one defective allele and one vsdld-type 
allele) and a homozygous mutant (i.e., two defective alleles; a heterologous construct for 
expression of an H4, such as a human H4, could be inserted to permit the knockout mammal to 
live if lack of H4 expression was lethal). Preparation of a knockout mammal requires first 
introducing a nucleic acid construct that will be used to suppress expression of a particular gene 
into an undifferentiated cell type termed an embryonic stem cell. This cell is then injected into 
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a mammalian embryo. A mammalian embryo with an integrated cell is then implanted into a 
foster mother for the duration of gestation. Zhou, et al (Genes and Development 1995, 
9:2623-34) describes PPCA knock-out mice. 

The term "knockout" refers to partial or complete suppression of the expression 
5 of at least a portion of a protein encoded by an endogenous DNA sequence in a cell. The term 
"knockout construct" refers to a nucleic acid sequence that is designed to decrease or suppress 
expression of a protein encoded by endogenous DNA sequences in a cell. The nucleic acid 
sequence used as the knockout construct is typically comprised of (1) DNA from some portion 
of the gene (exon sequence, intron sequence, and/or promoter sequence) to be suppressed and 

10 (2) a marker sequence used to detect the presence of the knockout construct in the cell. The 
knockout construct is inserted into a cell, and integrates with the genomic DNA of the cell in 
such a position so as to prevent or interrupt transcription of the native DNA sequence. Such 
insertion usually occurs by homologous recombination (i.e., regions of the knockout constmct 
that are homologous to endogenous DNA sequences hybridize to each other when the knockout 

1 5 constmct is inserted into the cell and recombine so that the knockout constmct is incorporated 
into the corresponding position of the endogenous DNA). The knockout constmct nucleic acid 
sequence may comprise (1) a full or partial sequence of one or more exons and/or introns of the 
gene to be suppressed, (2) a full or partial promoter sequence of the gene to be suppressed, or (3) 
combinations thereof. Typically, the knockout construct is inserted into an embryonic stem cell 

20 (ES cell) and is integrated into the ES cell genomic DNA, usually by the process of homologous 
recombination. This ES cell is then injected into, and integrates with, the developing embryo. 

The phrases "disruption of the gene" and "gene disruption" refer to insertion of 
a nucleic acid sequence into one region of the native DNA sequence (usually one or more exons) 

25 and/or the promoter region of a gene so as to decrease or prevent expression of that gene in the 
cell as compared to the wild-type or naturally occurring sequence of the gene. By way of 
example, a nucleic acid constmct can be prepared containing a DNA sequence encoding an 
antibiotic resistance gene which is inserted into the DNA sequence that is complementary to the 
DNA sequence (promoter and/or coding region) to be dismpted. When this nucleic acid constmct 

30 is then transfected into a cell, the constmct will integrate into the genomic DNA. Thus, many 
progeny of the cell will no longer express the gene at least in some cells, or will express it at a 



BNSOOCtD: <WO 0185786A2_I_> 



wo 01/85786 



PCT/lLJSOl/14527 



-35- 

decreased level, as the DNA is now disrupted by the antibiotic resistance gene. 

Generally, the DNA will be at least about 1 kb in length and preferably 3-4 kb in 
length, thereby providing sufficient complementary sequence for recombination when the 
knockout construct is introduced into the genomic DNA of the ES cell (discussed below). 

Included within the scope of this invention is a mammal in which two or more 
genes have been knocked out Such mammals can be generated by repeating the procedures set 
forth herein for generating each knockout construct, or by breeding to mammals, each with a 
single gene knocked out, to each other, and screening for those with the double knockout 
genotype. 

Regulated knockout animals can be prepared using various systems, such as the 
tet-repressor system (see U.S. Patent No. 5,654,168) orthe Cre-Lox system (see U.S. PatentNos. 
4,959,317 and 5,801,030). 

In another series of embodiments, transgenic animals are created in which (i) a 
human H4 is stably inserted into the genome of the transgenic animal; and/or (ii) the endogenous 
H4 genes are inactivated and replaced with hvmian H4 genes. See, e,g,, Coffinan, Semin. Nephrol. 
1997, 17:404; Esther et al. Lab. Invest. 1996, 74:953; Murakami et al.. Blood Press. Suppl. 
1996,2:36. 

H 4 Activation Assay 

Any cell assay system that allows for assessment of functional activity of H4 
agonists and antagonists is defined by the present invention. In a specific embodiment, 
exemplified infra, the assay can be used to identify compoimds tliat selectively interact with H4, 
which can be evaluated by assessing the effects of H4 transformed cells contacted with a test 
compound, which modulates cAMP accxunulation. The assay system can thus be used to identify 
compounds that selectively produce a functional effect through histamine H4 receptors. 
Compoimds that increase cAMP formation and acctimulation may be useful as novel therapeutics 
in the prevention of transplanted organ rej ection, asthma, allergies, and autoimmime pathologies, 
such as, but not limited to, multiple sclerosis, type I diabetes, and rheumatoid arthritis. 
Additionally, CNS functions such as cognitive and memory defects may be treated. Preferably, 
each experiment is performed in triplicate at multiple different dilutions of test compoxmd. 

An agonist and/or antagonist screen involves detecting cAMP accimiulation by 
the host cell when contacted with H4 ligand. If cAMP accumulation is increased, the test 
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compovmd is a candidate antagonist of H4 receptors. If cAMP accumulation is decreased, the test 
compound is a candidate agonist of H4 receptors. If there is no change in cAMP formation, the 
test compound is not an effective H4 ligand. 

Any convenient method permits detection of the formed product, cAMP. For 
example, the invention provides immimoassays for detecting cAMP. Typically, inraiunoassays 
use either a labeled antibody or a labeled antigenic component (e.g.^ that competes with the 
antigen in the sample for binding to the antibody). Suitable labels include without limitation 
enzyme-based, fluorescent, chemiluminescent, radioactive, or dye molecules. Assays that 
amplify the signals from the probe are also known, such as, for example, those that utilize biotin 
and avidin, and enzyme-labelled immunoassays, such as ELISA assays. Alternatively, labeled 
antigenic component may be quantified by scintillation techniques. In another method, the 
second messenger, preferably cAMP, will be separated on a high performance liquid 
chromatograph and quantified by a UV detector. 

The assay system described here also may be used in a high-throughput primary 
screen for agonists and antagonists, or it may be used as a secondary functional screen for 
candidate compoimds identified by a different primary screen, e.g., a binding assay screen that 
identifies compounds that interact with the receptor. 

High-Throughput Screen 

Agents according to the invention may be identified by screening in high- 
throughput assays, including without limitation cell-based or cell-free assays. It will be 
appreciated by those skilled in the art that different types of assays can be used to detect different 
types of agents. Several methods of automated assays have been developed in recent years so 
as to permit screening of tens of thousands of compounds in a short period of time. Such high- 
throughput screening methods are particularly preferred. The use of high-throughput screening 
assays to test for agents is greatly facilitated by the availability of large amoxmts of purified 
polypeptides, as provided by the invention. 

Compounds 

"Histamine" refers to a neurotransmitter that is produced and released from 
neurons. Histamine is formed from the amino acid histidine by histidine decarboxylase. 
Structurally, histamine is an imidazolethylamine. In other words, histamine is comprised of 
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an imidazole moiety and an amino group connected by an alkyl chain. The 
phamiacologically active fomi of histamine is proposed to be the monocationic tautomer, 
where one of the nitrogen present in the imidazole ring is positively charged. However, 
different forms of histamine may interact with histamine receptors to produce a functional 
effect. Histamine is produced intracellularly and stored until released in response to a 
physiological stimulus. 

"Histamine analogs" refers to compounds that comprise the imidazole, alkyl 
chain, and a nitrogen moiety (e.g. , amine (unsubstituted or substituted), piperidine, pyridine) 
of histamine, but may be modified at other positions. These modifications may be performed 
to alter affinity and/or selectivity of the compound for the histamine receptors. "Histamine 
compounds" refers to compoimds that may bind to the histamine receptors. 

Histamine analogs and compovmds can be classified as agonists or antagonists. 
As discussed previously, agonists are ligands that bind to the receptor and produce a 
functional eflFect similar to that produced by the endogenous ligand (Le, , histamine) for the 
receptor, whereas antagonists are ligands that bind to the receptor and block a functional 
effect produced by an agonist for the receptor or the endogenous ligand of the receptor. 
Histamine analogs and compounds are further described in Goodman and Oilman's The 
Pharmacological Basis of Therapeutics, Ninth Edition, McGraw-Hill, 1996, 

Agonists that may be contemplated by this invention include, but are not 
limited to, R-(a)-methylhistamine, imetit, and immepip. Antagonists, burimamide, 
impromidine, dimaprit, and thioperan[ude clobenpropit and iodophenpropit impentamine, 
GT2016 and iodoproxyfan. Other compoimds include derivatives, metabolites, and 
precursors. 

Methods of Diagnosis 
According to the present invention, genetic variants of H4 can be detected to 
diagnose an H4 associated disease, such as treatment for transplanted organ rejection, asthma, 
allergies, and autoimmune pathologies, such as, but not limited to, multiple sclerosis, type I 
diabetes, and rheumatoid arthritis. Additionally, CNS functions such as cognitive and 
memory defects may be treated. The various methods for detecting such variants are 
described herein. Where such variants inipact H4 function, either as a result of a mutated 
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amino acid sequence or because the mutation results in expression of a truncated protein, or 
no expression at all, they are expected to result in disregulation of the allergic response, the 
immune response, cognition and memory. 

5 Nucleic A cid Assays 

The DNA may be obtained from any cell source. DNA is extracted from the 
cell source or body fluid using any of the numerous methods that are standard in the art. It 
will be understood that the particular method used to extract DNA will depend on the nature 
of the source. Generally, the minimum amount of DNA to be extracted for use in the present 
10 invention is about 25 pg (corresponding to about 5 cell equivalents of a genome size of 4 x 
10^ base pairs). 

In another alternate embodiment, RNA is isolated from biopsy tissue using 
standard methods well known to those of ordinary skill in the art such as guanidium 
thiocyanate-phenol-chloroform extraction (Chomocyznski et al , Anal. Biochem., 1 62: 1 56, 

15 1987). The isolated RNA is then subjected to coupled reverse transcription and amplification 
by polymerase chain reaction (RT-PCR), using specific oligonucleotide primers that are 
specific for a selected site. Conditions for primer annealing are chosen to ensure specific 
reverse transcription and amplification; thus, the appearance of an amplification product is 
diagnostic of the presence of a particvdar genetic variation. In another embodiment, RNA is 

20 reverse-transcribed and amplified, after which the amplified sequences are identified by, e.g., 
direct sequencing. In still another embodiment, cDNA obtained from the RNA can be cloned 
and sequenced to identify a mutation. 

Protein Assays 

In an alternate embodiment, biopsy tissue is obtained from a subject. 

25 Antibodies that are capable of specifically binding to H4 are then contacted with samples of 
the tissue to determine the presence or absence of a H4 polypeptide specified by the antibody. 
The antibodies may be polyclonal or monoclonal, preferably monoclonal. Measurement of 
specific antibody binding to cells may be accomplished by any known method, e.g., 
quantitative flow cytometry, en^mie-linked or fluorescence-linked immunoassay. Western 

30 analysis, etc. 



BNSDOCID: <WO_0185786A2_L> 



wo 01/85786 



PCT/USOl/14527 



-39- 

Therapeutic Uses 

According to the present invention, stimulation of H4 receptor activity may be 
used as a treatment option in patients with histamine-related disease states. Stimulation of H4 
receptor activity may be by methods, such as, but not limited to, (i) providing polypeptides that 
stimulate receptor activity and (ii) providing compounds that stimulate receptor activity. 

Gene Therapy 

In a specific embodiment, vectors comprising a sequence encoding a protein, 
including, but not limited to, full-length H4, are provided to treat or prevent a disease or disorder 
associated with the function of H4 in peripheral blood leukocytes. In this embodiment of the 
invention, the therapeutic vector encodes a sequence that produces the protein of the invention. 

Any of the methods for gene therapy available in the art can be used according to 
the present invention. Exemplary methods are described below. 

For general reviews of the methods of gene therapy, see, Goldspiel et al. , Clinical 
Pharmacy, 1993, 12:488-505; Wu and Wu, Biotherapy, 1991, 3:87-95; Tolstoshev, Ann. Rev. 
Pharmacol. Toxicol., 1993, 32:573-596; Mulligan, Science, 1993, 260:926-932; andMorganand 
Anderson, Ann. Rev. Biochem., 1993, 62:191-217; May, TIBTECH, 1993, 11:155-215. Methods 
commonly known in the art of recombinant DNA technology that can be used are described in 
Ausubel et aL, (eds.), 1993, Current Protocols in Molecular Biology, John Wiley & Sons, NY; 
Kriegler, 1 990, Gene Transfer and Expression, A Laboratory Manual, Stockton Press, NY; and 
in Chapters 12 and 13, Dracopoli et al., (eds.), 1 994, Current Protocols in Human Genetics, John 
Wiley & Sons, NY. Vectors suitable for gene therapy are described above. 

In one aspect, the therapeutic vector comprises a nucleic acid that expresses a 
protein of the invention in a suitable host. In particular, such a vector has a promoter 
operationally linked to the coding sequence for the protein. The promoter can be inducible or 
constitutive and, optionally, tissue-specific. In another embodiment, a nucleic acid molecule is 
used in which the protein coding sequences and any other desired sequences are flanked by 
regions that promote homologous recombination at a desired site in the genome, thus providing 
for intrachromosomal expression of the protein (KoUer and Smithies, Proc. Natl. Acad. Sci. 
U.S.A, 1989, 86:8932-8935; Zijlstrae^ a/.. Nature, 1989,342:435-438). 

Delivery of the vector into a patient may be either direct, in which case the patient 
is directly exposed to the vector or a delivery complex, or indirect, in which case, cells are first 
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transformed with the vector in vitro then transplanted into the patient. These two approaches are 
known, respectively, as in vivo and ex vivo gene therapy. 

In a specific embodiment, the vector is directly provided in vivo, where it enters 
the cells of the organism and mediates expression of the protein. This can be accomplished by 
any of numerous methods known in the art, by constructing it as part of an appropriate expression 
vector and administering it so that it becomes intracellular, e,g., by infection using a defective 
or attenuated retroviral or other viral vector (see, U.S. Patent No. 4,980,286), or by direct 
injection of naked DNA, or by use of microparticle bombardment {e.g., a gene gun; Biolistic, 
Dupont); or coating with lipids or cell-surface receptors or transfecting agents, encapsulation in 
biopolymers {eg., poly-S-l-64-N-acetylglucosamine polysaccharide; see, U.S. Patent No. 
5,635,493), encapsulation in liposomes, microparticles, or microcapsules; by administering it in 
linkage to a peptide or other ligand known to enter the nucleus; or by administering it in linkage 
to a ligand subject to receptor-mediated endocytosis (see, e.g., Wu and Wu, J. Biol. Chem., 1 987, 
62:4429-4432), etc. In another embodiment, a nucleic acid ligand complex can be formed in 
which the ligand comprises a fiisogenic viral peptide to disrupt endosomes, allowing the nucleic 
acid to avoid lysosomal degradation. In yet another embodiment, the nucleic acid can be targeted 
in vivo for cell specific uptake and expression, by targeting a specific receptor (see, e.g., PCT 
Publication Nos. WO 92/06180, WO 92/22635, WO 92/20316 and WO 93/14188). 
Alternatively, the nucleic acid can be introduced intracellularly and incorporated within host cell 
DNA for expression by homologous recombination (KoUer and Smithies, Proc. Natl. Acad. Sci, 
USA, 1989, 86:8932-8935; Zijlstra, et al.. Nature, 1989, 342:435-438). These methods are in 
addition to those discussed above in conjunction with "Viral and Non-viral Vectors". 

The form and amount of therapeutic nucleic acid envisioned for use depends on 
the type of disease and the severity of the desired effect, patient state, etc., and can be determined 
by one skilled in the art. 

Inhibition or stimulation of protein synthesis 

Gene transcription and protein translation may be inhibited or stimulated by 
administration of exogenous compounds. Exogenous compounds may interact with 
extracellular and/or intracellular messenger systems, such as, but not limited to, adenosine 
triphosphate, nitric oxide, and guanosine triphosphate; to regulate protein synthesis. In this 
embodiment, exogenous compounds that stimulate or inhibit H4 protein synthesis may be 
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used in the prevention and/or treatment for transplanted organ rejection, asthma, allergies, and 
autoimmune pathologies, such as, but not limited to, multiple sclerosis, type I diabetes, and 
rheumatoid arthritis. Additionally, CNS functions such as cognitive and memory defects may 
be treated. 

The present invention provides antisense nucleic acids (including ribozymes), 
which may be used to inhibit expression of H4 of the invention. The antisense nucleic acid, 
upon hybridizing xmder cytoplasmic conditions with complementary bases in an RNA or 
DNA molecule, inhibits the role of the RNA or DNA. Additionally, hybridization of the 
antisense nucleic acid to the DNA or RNA may inhibit transcription of the DNA into RNA 
and/or translation of the RNA into the protein. If the RNA is a messenger RNA transcript, 
the antisense nucleic acid is a coxmtertranscript or mRNA-interfering complementary nucleic 
acid. Antisense nucleic acid molecules can be encoded by a recombinant gene for expression 
inacell (e.g., U.S. Patent No. 5,814,500; U.S. Patent No. 5,811,234) or can be prepared 
synthetically (e.g., U.S. Patent No. 5,780,607). 

Alternatively, antibody molecules can also be administered, for example, by 
expressing nucleotide sequences encoding single-chain antibodies within the target cell 
population by utilizing, for example, techniques such as those described in Marasco et al. 
(Proc. Natl. Acad Sci. USA, 1993, 90:7889-7893). 

Therapeutically suggested compounds may be provided to the patient in 
formulations that are known in the art and may include any pharmaceutically acceptable 
additives, such as excipents, lubricants, diluents, flavorants, colorants, and disintegrants. The 
formulations may be produced in useful dosage imits such as tablet, caplet, capsule, liquid, or 
injection. 

The form and amount of therapeutic compound envisioned for use depends on the 
type of disease and the severity of the desired effect, patient state, etc., and can be determined by 
one skilled in the art. 

EXAMPLES 

The present invention will be better imderstood by reference to the following 
Examples, which are provided as exemplary of the invention, and not by way of limitation. 
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EXAMPLE I: CHARACTERIZATION OF THE RECEPTOR 

A GPCR sequence profile was generated from a sequence alignment of members 
of this subfamily based on hidden Markov Models (Eddy Bioinformatics, 1998 14:755-763; 
Durbin et al A tutorial introduction to hidden Markov models and other probabilistic modeling 
5 approaches in computational sequence analysis, Cambridge University Press, 1998) to look for 
novel members in the human genome database. In detail, GPCR proteins of the biogenic amine 
subfamily were retrieved from S wiss-Prot database and a sequence alignment was generated by 
a multiple sequence alignment tool, named CLUSTALW (Thompson et ah Nucleic Acids Res, 
1994, 22:4673-80). Using the HMMER program (Eddy, PIMMER User's Guide and Program, 

10 Version 2.1, 1998), a consensus sequence (GNLLVILVIL RTKKLRTPTN IFILNLAVAD 
LLFLLTLPPW ALYYLVGGSE DWPFGSALCK LVTALDVVNM YASILLLTAI 
SIDRYLAIVH PLRYRRRRTS PRRAKVVILL VWVLALLLSL PPLLFSWVKT 
VEEGNGTLNV NVTVCLIDFP EESTAS VSTW LRS YVLLSTL VGFLLPLLVI LVC YTRILRT 
LRKAAKTLLV VVVVFVLCWL PYFIVLLLDT LCLSIIMSST CELERVLPTA 

15 LLVTLWLAYV NSCLNPEY; SEQ ID NO: 3) was developed from the biogenic amine 
subfamily members. The consensus sequence contained the unique 7-transmembrane sequence 
stmcture of biogenic amine GPCRs. 

A weekly update of nucleotide sequence from GenBank database is maintained 
in-house. An auto-search script using TBLASTN program weis written and the biogenic amine 

20 GPCR consensus sequence was used to search this database weekly. Every TBLASTN search 
result was carefully examined and potential open reading frame (ORF) fragments were extracted 
from nucleotide sequence. Each fragment was fiuther verified to determine its novelty. 

Four peptide fragments translated from a recently released human genomic 
sequence (Accession nxmiber: AC007922) of chromosome 18 clone RP11-178F10, from the 

25 Whitehead Institute/MIT Center for Genome Research, were shown to have modest homology 
to different regions of this GPCR consensus sequence. These four fragments located to different 
regions of sequence AC009722 in both plus and minus strands. Interestingly, the highest scoring 
hit from searching the protein database with these peptides by BLASP was in all cases, the 
human histamine receptor 3 (H3). It is very likely that these four fragments are exons of a GPCR 

30 gene and their appearance in both strands of this genomic sequence may result from incorrect 
genomic contig assembly. 
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To clone the full length ORF, multiple primer sets were designed against the 
predicted sequence. Using multiple primer pairs, 3 overlapping pieces spanning the entire ORF 
were obtained jfrom a human heart marathon cDNA library (Clontech, Palo Alto, CA). Using the 
marathon adapter primer API, and a reverse primer corresponding to ORF bases 644-615, a 
5 fragment of the H4 cDNA was obtained which corresponds to bases 276-644 of the H4 coding 
region (exon sequence) and 39 bases of intronic sequence at the 5' end. Primers comprising 
nucleotides 461-482 and 1 173-1 146 were used to PGR a 712bp band. The 5' sequence was 
obtained using primers comprising nucleotides 1-32 and 339-309. The outer primers (1-32, 
1173-1146) were used to piece the 3 fragments together. The sequence was ligated into the 
10 mammalian expression vector pCDNA3.1 + zeo (Invitrogen, Carlsbad, CA). 

EXAMPLE II: TISSUE EXPRESSION OF THE RECEPTOR 

Quantitative RT-PCR was performed on an ABI 7700 "Taqman" sequence 
detection system to determine the tissue distribution of the H4 receptor. Primers spanning the 

1 5 exon 1-2 boundary (Forward primer: 5 -taacttggccattgacttctt-3' (SEQ ID NO:4); Reverse primer: 
5-attcgaacagcatgtgagggat-3' (SEQ ID NO:5) and a Probe: 5 -(6-carboxyfluorescein)- 
tacaaaggaatggagatcacacccaca- (6-carboxy-N,N,N'N'-tetramethylrhodamine)-3' (SEQ IDNO:6)) 
were used to determine H4 expression levels in a mRNA prepared from a series of human tissues, 
purchased from Clontech (Palo Alto). A 2 step reaction procedure was performed as per 

20 manufacturers directions. Briefly, 2mg RNA was reverse transcribed using random hexamers 
(2.5mM in a final volume of 20ml. 14ul of this was used in the PGR reaction. The cyling 
conditions were as follows, 95 °C for 10 minutes, followed by 40 cycles of (a) 95 °C for 15 
seconds and (b) 60 °C for 1 minute. 

25 EXAMPLE M: PHARMACOLOGICAL PROFILE OF THE H. RECEPTOR IN YEAST 

CELLS 

The H4 receptor protein coding sequences were amplified using Forward oligo: 
5'-aaggatccaaaatgccagatactaatagc-3' (SEQ ID NO:7) and Reverse oligo: 5'- 
aagtcgacttaagaagatactgaccgac-3' (SEQ ID NO: 8) that add BamHI and yeast consensus 
30 translational initiation sites to the 5' end and a Sail site to the 3' end. The fragment was cloned 
into corresponding sites in the multicopy yeast expression vector, p426GPD, thus placing 
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receptor expression under control of the strong constitutive GPDl promoter. The yeast 
expression plasmid, pMP327, was introduced into MPY578i5 cells (MATa ura3 his3 trpl leu2 
lys2 ade2 farl::LYS2 fusl::FUSl-fflS3 sst2::SST2-G418R ste2::LEU2, gpal::GPAli5) 
(Hadcock, J.R. and Pausch, M.H. submitted) using LiOAc and selected for ura prototrophy. In 
order to facilitate coupling of the receptor to the G protein, MPY578i5 cells express a chimeric 
G alpha protein coupled to the mating signal transduction pathway. The chimeric construct is 
expressed from the GPAl locus and is composed of Gpal sequences in which the 5'-C-terminal 
amino acids have been replaced with those of Gai3. A multicopy FUSl-LacZ reporter gene 
plasmid, pMP283 (Hadcock, J.R. and Pausch, M.H. submitted), was subsequently introduced into 
H4R-containing MPY578i5 cells and selected on media lacking trp and ura. The resulting yeast 
strain, MPY733, was used for further analysis. 

Samples (250 ng) of compounds present in the LOP AC panel (Sigma RBI, Natick 
MA), were dispensed to 96 well microtiter dishes. MPY733 cells (5 x lOVml, 200 ml/well) in 
assay medium (SCD-ura-trp, pH 6.8, 25 mM PIPES, O.lmg/ml Chlorophenylred 
bglactopyranoside(CPRG), 2 mM 3-AT) were added and cultured overnight at 30 ^^C. The 
presence of active compounds was detected the next day by measurement of absorbance at 570 
nm using a Wallac Victor 11. The LOP AC panel was screened in duplicate and in parallel with 
another yeast strain containing a different orphan GPCR. Only compounds that produced 
significantly elevated absorbance in both receptor containing plates and not in the other GPCR 
containing plates were deemed active. 

EXAMPLE IV: PHARMACOLOGICAL PROFILE OF THE RECEPTOR IN 
MAMMALIAN CELLS 

The ORF was modified by PGR for mammalian expression of H4. A 5' Hindin 
restriciton enzymes site and a Kozak consensus sequence were added using the primer 5 - 
aagcttccaccatgccagatactaatagcacaatcaatttatc-3' (SEQ ID NO:9), and a 3' Xbal site added 5 - 
tctagattaagaagatactgaccgactgtgttg-3 ' (SEQ ID NO: 1 0). The sequence was confirmed and ligated 
into the Hindlll and Xba I sites of the mammalian expression vector pCDNA3.1+ zeo 
(Invitrogen, Carlsbad CA). HEK 293 cells (approximately 10^ cells) were transfected with the 
pCDNA3.1+zeo/H4 using standard lipofectamine plus reagent (Life Technologies). Cells were 
maintained in DMEM containing 1 0% fetal calf serum and penicillin (1 00units/ml)/streptomycin 
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(lOOmg/ml). 48 hours post transfection cells were selected using 500mg/ml zeocin. Zeocin 
resistant clones were assayed using the cAMP assay and by RT-PCR. RNA was extracted from 
approximately 10^ cells using One step PGR kit (Life technologies). The primers used in the 
extraction were 5'-ggaaggatgaaggtagtgaatg-3' (SEQ IDNO:l 1) and 5 -cagaatctgattgggaggaagg-3' 
(SEQE)NO:12). 

HEK cells stably expressing the H4 receptor were assessed functionally in cAMP 
assays using the cAMP scintillation proximity assay (SPA) (Amersham Pharmacia Biotech, 
Pistcataway, NJ). Briefly, 40 000 cells were plated into wells of a 96 well plate. 24 hours later 
the media was removed and replaced with 100|liI Krebs bicarbonate buffer and the cells were 
incubated at 37°C for 15 minutes. Following this, the cells were incubated in Krebs buffer 
containing 0.5mM isobutylmethylxanthine, a phosphodiesterase inhibitor, to prevent cAMP 
breakdown. The effect of H4 on the forskolin induced formation was determined by incubating 
the cells in the presence of forskolin (lOfiM) and agonist for 12 minutes. cAMP levels were 
determined using the cAMP SPA kit according to the manufacturers directions. 

EXAMPLE V: EXPRESSION IN INDUCED CD4 AND CDS CLONES 

This examination was undertaken using the TaqMan EZ RT PGR kit and the 
following oligonucleotides (provided by Philip Jones at Wyeth Neuroscience) (i) H4 EX IF 
(forward oligo): 5'-taacttggccatctttgac-3' (SEQ ID NO:13), (ii) H4 EX IR (reverse primer): 5'- 
attcgaacagcgtgtgag-3' (SEQ ID NO: 14), and H4 EX 1 Probe: 5'-(6-carboxyfluorescein)- 
tacaaaggaatggagatca-3' (SEQ ID NO: 15). RNA for the standard curve was polyA+ human 
leukocyte RNA from Clontech. 

50 nanograms of total RNA was assayed in duplicate from 6 human T cell clones 
derived from a single human donor. Three of these clones were CD4-I-, two were CD8+, and one 
was an NK T cell clone. These T cell clones were stimulated vsath anti-CD3 and RNA was 
isolated at 0, 2, 4, 8, 24 and 48 hours after stimulation. Expression was normalized with 
GAPDH. 

In order to determine whether the pattern of H4 expression was specific to clonal 
populations of human lymphocytes, the same TaqMan quantitative PGR assay on 10 nanograms 
of RNA from bulk populations of CD4+ and CD8+ T cells isolated from three different 
individuals. These lymphocytes were stimulated with anti-CD3 and harvested at the same time 
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points listed above. H4 expression was nonnalized with GAPDH. 

RESITLTS 

Using a GPCR sequence profile generated from a Hidden Markov Model (HMM) 
5 of the biogenic amine subfamily, several sequences with homology to the human H3 receptor 
were identified fi*om the hviman genomic sequence of chromosome 18 (clone RPll- 178F10 
Accession number: AC007922 fi"om the Whitehead Institute/MIT Center for Genome Research) 
(Eddy Bioinformatics, 1998, 14:755-763). A contig of the predicted exons forms a sequence 
encoding aputative GPCR whose nucleotide sequence and translated peptide sequence are shown 

10 in Figure 1 and SEQ ID NOS: 1 and 2. The sequence was deposited with Genbank (Accession 
Number AF307973). 

The predicted protein sequence contains residues that are conserved across the 
biogenic amine receptors. As in other biogenic amine receptors, the present protein contains 
conserved aspartate residues within transmembrane (TM) domains 2 and 3 (at positions 61 and 

15 94). A DY (D94 and Y95) motif in TM3 is found only in histamine and muscarinic receptors. 
A DRY sequence motif that exists at the cytoplasmic interface of TM3 of all GPCRs also can be 
found in the H4 sequence (amino acids 111- 113). Also present are two conserved cysteine 
residues in the first and second extracellular loops (position 88 and 164), which are predicted 
to form a disulfide bond; two conserved tryptophan residues in TM4 and TM6 (positions 140 and 

20 316); and conserved proline residues in TM5, TM6, and TM7 (positions 186, 318, and 355, 
respectively). Based on these results, H4 cDNA appears to encode a biogenic amine-like 
receptor. 

The predicted open reading fi:ame is 1 1 73 base pairs long and encodes a protein 
of 390 amino acids. Sequence comparison using BLASTP, under standard conditions, analysis 
25 reveals that the novel protein is most similar to the human H3 receptor (44% identical and 51% 
similar). Based on sequence homology it is proposed that the receptor belongs to the histamine 
receptor family, therefore we have termed it the H4 receptor. 

Receptor distribution studies indicate that the present receptor is highly expressed 
in peripheral blood leukocytes. Trace amounts of the H4 receptor are expressed in heart, lung and 
30 placenta. It is proposed that these trace amounts of expression represent blood cell mRNA 
present in the samples. 
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Functional studies further confirmed the categorization of the protein as a novel 
histamine receptor. Histamine and the H3 selective agonist R-a-methylhistamine stimulated 
cAMP accumulation, suggesting these compounds are agonists at the H4 receptor (See Figure 
2A). R-a-methylhistamine exhibited both lower potency and efficacy than histamine at the H4 
receptor despite being more potent at the related cloned H3 receptor and in several tissue based 
assays for the H3 receptor (VoUinga, etal, J. Med. Chem. 1995, 38:266-271; Harper, etal, Br. 
J. Pharmacol. 1999, 128:751-759). The selective H3 receptor antagonist clobenpropit was a 
partial agonist at the H4 receptor (See Figure 2A). 

The H3 antagonist thioperamide almost fully inhibited the stimulatory response 
produced by histamine (See Figure 2B). Comparatively, clobenpropit partially blocked 
histamine-induced stimulation of cAMP formation, further suggesting that clobenpropit is a 
partial agonist at the H4 receptor. The interaction of the H4 receptor to a GPAI/ Gai3 chimeric 
G-protein alpha subunit also predicts its coupling specificity in mammalian cells. 

Expression of the H4 receptor in HEK 293tsa cells confirms the coupling of the 
H4 receptor to the inhibition of cAMP formation (See Figure 2C). In these cells, forskolin 
stimulated 8-fold greater cAMP formation compared to basal levels. Addition of histamine (1 
\xM) inhibited forskolin-induced stimulation of cAMP accxunulation by about 40%. These 
studies suggest that the H4 receptor couples an inhibitory G-protein to inhibit adenylyl cyclase 
activity and cAMP accumulation. 

Structure-activity relationship studies were conducted with several histamine 
antagonists to further define the pharmacological profile of this receptor. Studies were conducted 
to correlate the effect of the alkyl chain to antagonist activity at the histantiine H4 receptor (See 
Figure 3). The rank order of efficacy obtained for the htmian H4 was (number represents the 
length of the alkyl chain) 5>6= 1 0>8=4. This rank order of efficacy contrasts with the human 
H3 receptor where the rank order is 5>4>6>8>10. 

Prior studies have indicated the expression of at least two subtypes of the H3 
receptor. These two potential H3 subtypes, localized in the rat brain and guinea pig j ej enum, have 
been shown to have rank order of efficiencies of 4=5>3>6>8 and 5>6==4>8>3, respectively. 
These assays were different (rat was a radioligand binding assay and guinea pig was an organ 
bath experiment using isolated guinea pig ileum). However, the relative potencies of the series 
can be compared, so its possible to say that as the rank order differs then the receptors are likely 



wo 01/85786 



PCTAJSOl/14527 



-48- 

to be different (Leurs etal J. Pharm. Exp. Ther./l996, 276:1009-1015.) 

The present studies also indicate that thioperamide is more efficaciovis and potent 
than impentamine at the H4 receptor. Comparatively, impentamine has greater affinity for cloned 
histamine H3 receptors (4-fold) than thioperamide (Ki=50,8nM and 193nM respectively). 

H4 expression was consistently detected in all three CD4+ clones at most time 
points (Figure 4). There was no consistent temporal pattern to anti-CD3 induction of expression 
between these three clones. One CD8+ clone showed a very low but detectable level of 
expression at most time points, while the other as well as the MKT clone had no detectable H4 
expression despite good GAPDH amplification. 

H4 expression, normalized with GAPDH, was detectable in all samples at all time 
points (Figure 5). Anti-CD3 stimulation produced significant induction of H4 expression in all 
6 samples, most peaking at the 8 hour time point. The highest level of induction was foomd in 
the 8 hour CD4+ and CD8+ samples fi:om the same donor (designated 'U'), peaking at greater 
than 100 copies of H4 RNA per copy of GAPDH. 

Combined, these studies indicate that the pharmacological profile of the H4 
receptor is not similar to any known histamine H3 receptor. 

* * H« 

The present invention is not to be limited in scope by the specific embodiments 
described herein. Indeed, various modifications of the invention in addition to those described 
herein will become apparent to those skilled in the art fi:om the foregoing description and the 
accompanying figures. Such modifications are intended to fall within the scope of the appended 
claims. 

It is fiirther to be understood that values are approximate, and are provided for 

description. 

Patents, patent applications, publications, procedures, and the like are cited 
throughout this application, the disclosures of which are incorporated herein by reference in their 
entireties. 
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1 1. An isolated H4 receptor protein having an amino acid sequence at least 

2 51% identical or comprising at least 10 contiguous amino acids from the sequence depicted in 

3 SEQ ID NO:2 which H4 receptor protein binds ligands comprising an imidazole and an amine, 

4 which imidazole and amine are attached by an alkyl chain, wherein the rank order of efficacy of 

5 modulation second messenger formation of the ligands at the H4 receptor protein is 5>6= 1 0>8=4, 

6 where the number represents the number of carbons in the alkyl chain. 

1 2. The H4 receptor protein of claim 1 , wherein upon binding histamine or a 

2 histamine agonist the receptor protein inhibits second messenger formation. 

1 3. The H4 receptor protein of claim 2, wherein the second messenger is 

2 cAMP. 

1 4. The H4 receptor protein of claim 1 which is a human H4 receptor protein. 

1 5. The H4 receptor protein of claim 4 which has an amino acid sequence as 

2 depicted in SEQ ID NO: 2. 

1 6. The H4 receptor protein of claim 4 which is encoded by a nucleic acid 

2 having a sequence as depicted in SEQ ID NO: 1 . 

1 7. An isolated H4 receptor protein having an amino acid sequence with at 

2 least 95% sequence identity to human H4 receptor protein having an amino acid sequence as 

3 depicted in SEQ ID NO: 2. 

I 8 . An isolated nucleic acid encoding the H4 receptor protein of claim 1 or 7. 

1 9. The nucleic acid of claim 8 which is a cDNA. 

BNSDCX^ID: <WO Q1BS7BaA2 I > 
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1 10. The nucleic acid of claim 8, wherein the H4 receptor protein is a human 

2 H4 receptor protein. 

1 11. An isolated nucleic acid encoding an H4 receptor protein, which nucleic 

2 acid hybridizes under stringent conditions to a nucleic acid having a sequence of at least 20 

3 nucleotides identical to a corresponding nucleotide sequence of the same number of bases in SEQ 

4 ID NO: 1 or its complement, 

1 12. The nucleic acid of claim 8 , which encodes an H4 receptor protein having 

2 an amino acid sequence as depicted in SEQ ID NO:2. 

1 13. The nucleic acid of claim 12, which comprises a nucleotide sequence as 

2 depicted in SEQ ID NO: L 

1 1 4 . A vector comprising the nucleic acid of claim 8 operably associated with 

2 an expression control sequence. 

1 1 5. A host cell transfected with the vector of claim 14. 

1 16. Anon-humananimaltransformedwiththe vector of claim 14, wherein the 

2 animal expresses a H4 receptor protein at a detectable level, whereby the cells expressing the H4 

3 receptor protein suppress cAMP formation when contacted with an H4 receptor agonist. 

1 1 7. A method for producing a H4 receptor protein, which method comprises 

2 culturing host cells of claim 15 under conditions that provide for expression of the H4 receptor 

3 protein by the vector. 

1 18. An isolated nucleic acid of at least ten bases having a nucleotide sequence 

2 identical to a corresponding nucleotide sequence of the same number of bases in SEQ ID NO: 

3 1 or its complement. 
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1 19. The nucleic acid of claim 1 8 which is detectably labeled. 

1 20. An antibody that specifically binds to the H4 receptor protein of claim 1 

2 or 7. 

1 21 . A method for detecting an H4 receptor protein, which method comprises 

2 detecting binding of the antibody of claim 20 to a protein in a sample suspected of containing a 

3 H4 receptor protein, wherein the antibody is contacted with the sample under conditions that 

4 permit specific binding with any H4 receptor protein present in the sample, 

1 22. A method for detecting expression of H4 receptor, which method 

2 comprises detecting mRNA encoding H4 receptor in a sample jfrom a cell suspected of expressing 
H4 receptor. 

1 23 . The method according to claim 22 wherein mKNA encoding H4 receptor 

2 is detected by hybridization to a H4 receptor-specific nucleic acid. 

1 24. The method according to claim 23 wherein the H4 receptor-specific nucleic 

2 acid is at least 1 0 nucleotides in length and has a sequence identical to a sequence of the same 

3 number of bases in SEQ ID NO: 1, or the complementary sequence thereof. 

1 25. An assay system for identifying H4 receptor ligands, comprising a 

2 sufficient number of cells of claim 1 5 to detect an alteration in second messenger accumulation. 

1 26. The assay system of claim 25, wherein the second messenger is cAMP. 

1 27. The assay system of claim 25, wherein the receptor is a human receptor. 

1 28, A method for identifying a test compound that antagonizes histamine H4 

2 receptors, which method comprises detecting an increase in the level of a second messenger in 

3 an assay system of claim 25 contacted with the test compound, wherein an increase in the level 

4 of the second messenger indicates that the test compound ant£^onizes the H4 receptor. 



i 
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1 29. A method for identifying a test compound that agonizes histamine H4 

2 receptors, which method comprises detecting a decrease in the level of a second messenger in 

3 an assay system of claim 25 contacted with the test compound, wherein the decrease in the level 

4 of the second messenger indicates that the test compound agonizes the H4 receptor. 

1 30. A method for identifying a compound that binds an H4 receptor, which 

2 method comprises detecting binding of a test compound to the H4 receptor protein of claim 1. 

1 31, The method according to claim 29, wherein binding of the test compound 

2 is detecting by inhibiting binding of a labeled H4 ligand. 

1 32. The method according to claim 29, wherein the H4 receptor protein is in 

2 a lipid bilayer membrane. 

1 33 . An isolated nucleic acid that specifically hybridizes under highly stringent 

2 conditions to the complement of the sequence depicted in SEQ ID NO:l, wherein said nucleic 

3 acid encodes a histamine H4 receptor protein. 
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Figure 1 



( I ) ATC}CCAGATACrA\TAGCACAATCX^TTTATC.ACTAAGCACTCGTCrTACm 

MPDTNSTI NLSLSTRVTLAF 



TTTATGrCXrrTAGrAGCTTTTOCrATAATOCTAGGAAATGCTTTGGTCAT^ 
FMSLV'AFAI MLGNALVJ LAF 

(12 1) CrGGrGGACAA.^A.ACCTTAGACATCGAAGTAGrTA m U I iCl I .VCTTGOCCATCrCT 
VVDKKLRHRSSYFFLNLAI S 



GALI ICJ I ICrOGGTGrGATCrcX:>.TTCXTTTGrACATaXTCACACarrcr^ 
DFFXGVl SI PLYI PHTLFEW 

J4n G\TTTTGGA.A.AGGvWCrCrGrATTTTGGCTCACrACTCiACrATCrOT 

DFGKEI CVFWLTTO^ LLCTA 



TCTGTAT ATA.ACATTGrCCrCATCAGCTATG4^T0GAT ACXTCTCAGTCTCA^ A 
SVYNI VLI SYDRYLSVSNA\^ 



(361) ACTTATAGAACTCAACATACrGaOCTCTTGAAGATTGrTiCTCrGATOT 

SYRTQHTGVLKI VTLMVAVW 

GTOCTGOOCI I CI I AGrGAATGOOCCAATGATTCrAGrrTCAGAGrcrTGGAAGGATGA.A 
VLAFLVNGPMI LVSESWKDE 



1481) GOT AGTGAATGrGA.AOCTOGAi 1 1 I I Tl OOG^ATGG^ACATOCTTGCX:ATCACATCATTC 
GSECEPGFFSEWYI LAI TS F 



TTOCL\ATTa3TGATCXXj«LGTCATCrrACn*C^^ 
LEFVI PVI LVAYFNMNI YWS 

»60l) C^GTGaAu^GCGTGATCATCTCAGrAGG^GCCA.^AOCCATCXTC^ 

LWKRDHLS RCQS HPGLTA\'S 

TCCAACATCTGTOaACACTCATrCAGAGGrAGACrATCrrCAAGa^ 
SNI CGHSFRGRLSSRRSLS A 

I 72 I ) Ta3ACAGAAGTTCCTGCATOCTTTCATTCAGAG\GACAGA<^ 

STEVPASFHSERQRRKSSLM 

TmOCTCAAGAAOCAAGATGAATAGCAATACAATTOCnr^ 
FSSRTKMNSNTI ASKMGSFS 

(841) CAATCAGATTCrGTAGCTCrrCAOCAAAGOGAACATGITGA^ 

QS DSVALHQREHVEL LRARR 



TTA0CXV^AGrCACrCX30CATTCTCrTAO00Cjri 1 1 lUJIU l i lUJIG CaXTOCATATTCr 
LAKSLAI LLGVFAVCWAPYS 

(961) CTGrTCACAATTGTCXnTTCATTTTATTCCTCAGCAACAG^ 

LFTI VLSFYSSATGPKSVWY 



AGAATTOCATTTTCXKnnrCAGrGGITCAAITCCmGrC^^ 
RI AFWLQWFNSFVNPLLYPL 

(1081) TGTCACAA^XXKXJTCAAAACaJi I I CI I GAAAATATTTT<n'ATAAAAAAGCAAOCT(rr A 
CHKRFQKAFLKI FCI KKQPL 

(1141) CXZATCilACAACACAGrCOCTCAGrATCrrCTTA^ 
PSQHSRSVSS. 
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Figure 3 
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SEQUENCE LISTING 

<110> Jones, Philip 
Shujian, Wu 
Blatcher, Maria 
Pausch, Mark 

<120> Human Histamine H4 Receptor 

<130> 3655/1J100-US3 

<140> TBA 

<141> Concurrently Herewith 

<150> 60/202,151 
<151> 2000-05-05 

<150> 60/227,567 
<151> 2000-08-23 

<150> 60/247,855 
<151> 2000-11-13 

<160> 15 

<170> Patentin version 3.0 

<210> 1 

<211> 1173 

<212> DNA 

<213> Homo sapiens 

<400> 1 



atgccagata 


ctaatagcac 


aatcaattta 


tcactaagca 


ctcgtgttac 


tttagcattt 


60 


tttatgtcct 


tagtagcttt 


tgctataatg 


ctaggaaatg 


ctttggtcat 


tttagctttt 


120 


gtggtggaca 


aaaaccttag 


acatcgaagt 


agttattttt 


ttcttaactt 


ggccatctct 


180 


gacttctttg 


tgggtgtgat 


ctccattcct 


ttgtacatcc 


ctcacacgct 


gttcgaatgg 


240 


gattttggaa 


aggaaatctg 


tgtattttgg 


ctcactactg 


actatctgtt 


atgtacagca 


300 


tctgtatata 


acattgtcct 


catcagctat 


gatcgatacc 


tgtcagtctc 


aaatgctgta 


360 


agttatagaa 


ctcaacatac 


tggggtcttg 


aagattgtta 


ctctgatggt 


ggccgtttgg 


420 


gtgctggcct 


tcttagtgaa 


tgggccaatg 


attctagttt 


cagagtcttg 


gaaggatgaa 


480 


ggtagtgaat 


gtgaacctgg 


atttttttcg 


gaatggtaca 


tccttgccat 


cacatcattc 


540 


ttggaattcg 


tgatcccagt 


catcttagtc 


gcttatttca 


acatgaatat 


ttattggagc 


600 


ctgtggaagc 


gtgatcatct 


cagtaggtgc 


caaagccatc 


ctggactgac 


tgctgtctct 


660 
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tccaacatct 


gtggacactc 


attcagaggt 


agactatctt 


caaggagatc 


tctttctgca 


720 


tcgacagaag 


ttcctgcatc 


ctttcattca 


gagagacaga 


ggagaaagag 


tagtctcatg 


780 


ttttcctcaa 


gaaccaagat 


gaatagcaat 


acaattgctt 


ccaaaatggg 


ttccttctcc 


840 


caatcagatt 


ctgtagctct 


tcaccaaagg 


gaacatgttg 


aactgctcag 


agccaggaga 


900 


ttagccaagt 


cactggccat 


tctcttaggg 


gtttttgctg 


tttgctgggc 


tccatattct 


960 


ctgttcacaa 


ttgtcctttc 


attttattcc 


tcagcaacag 


gtcctaaatc 


agtttggtat 


1020 


agaattgcat 


tttggcttca 


gtggttcaat 


tcctttgtca 


atcctctttt 


gtatccattg 


1080 


tgtcacaagc 


gctttcaaaa 


ggctttcttg 


aaaatatttt 


gtataaaaaa 


gcaacctcta 


1140 


ccatcacaac 


acagtcggtc 


agtatcttct 


taa 






1173 



<210> 2 

<211> 390 

<212> PRT 

<213> Homo sapiens 

<400> 2 

Met Pro Asp Thr Asn 
1 5 

Thr Leu Ala Phe Phe 
20 

Asn Ala Leu Val lie 
35 

Arg Ser Ser Tyr Phe 
50 

Gly Val lie Ser lie 
65 

Asp Phe Gly Lys Glu 

85 

Leu Cys Thr Ala Ser 
100 

Tyr Leu Ser Val Ser 
115 

Val Leu Lys lie Val 

130 

. Leu Val Asn Gly Pro 



Ser Thr lie Asn Leu 

10 

Met Ser Leu Val Ala 
25 

Leu Ala Phe Val Val 
40 

Phe Leu Asn Leu Ala 
55 

Pro Leu Tyr lie Pro 
70 

lie Cys Val Phe Trp 

90 



Val Tyr Asn lie Val 
105 

Asn Ala Val Ser Tyr 
120 

Thr Leu Met Val Ala 
135 

Met lie Leu Val Ser 



Ser Leu Ser Thr Arg Val 

15 

Phe Ala lie Met Leu Gly 
30 

Asp Lys Asn Leu Arg His 
45 

lie Ser Asp Phe Phe Val 
60 

His Thr Leu Phe Glu Trp 
75 80 

Leu Thr Thr Asp Tyr Leu 

95 



Leu lie Ser Tyr Asp Arg 
110 

Arg Thr Gin His Thr Gly 
125 

Val Trp Val Leu Ala Phe 
140 

Glu Ser Trp Lys Asp Glu 
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145 150 155 160 

Gly Ser Glu Cys Glu Pro Gly Phe Phe Ser Glu Trp Tyr lie Leu Ala 

165 170 175 

lie Thr Ser Phe Leu Glu Phe Val lie Pro Val lie Leu Val Ala Tyr 
180 185 190 

Phe Asn Met Asn lie Tyr Trp Ser Leu Trp Lys Arg Asp His Leu Ser 
195 200 205 

Arg Cys Gin Ser His Pro Gly Leu Thr Ala Val Ser Ser Asn lie Cys 
210 215 220 

Gly His Ser Phe Arg Gly Arg Leu Ser Ser Arg Arg Ser Leu Ser Ala 
225 230 235 240 

Ser Thr Glu Val Pro Ala Ser Phe His Ser Glu Arg Gin Arg Arg Lys 

245 250 255 

Ser Ser Leu Met Phe Ser Ser Arg Thr Lys Met Asn Ser Asn Thr lie 

260 265 270 

Ala Ser Lys Met Gly Ser Phe Ser Gin Ser Asp Ser Val Ala Leu His 
275 280 285 

Gin Arg Glu His Val Glu Leu Leu Arg Ala Arg Arg Leu Ala Lys Ser 
290 295 300 

Leu Ala lie Leu Leu Gly Val Phe Ala Val Cys Trp Ala Pro Tyr Ser 
305 310 315 320 

Leu Phe Thr lie Val Leu Ser Phe Tyr Ser Ser Ala Thr Gly Pro Lys 

325 330 335 

Ser Val Trp Tyr Arg lie Ala Phe Trp Leu Gin Trp Phe Asn Ser Phe 
340 345 350 

Val Asn Pro Leu Leu Tyr Pro Leu Cys His Lys Arg Phe Gin Lys Ala 
355 360 365 

Phe Leu Lys lie Phe Cys lie Lys Lys Gin Pro Leu Pro Ser Gin His 
370 375 380 

Ser Arg Ser Val Ser Ser 
385 390 

<210> 3 

<211> 259 

<212> PRT 

<213> Artificial 

<220> 

<223> Consensus sequence 
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<400> 3 

Gly Asn Leu Leu Val lie Leu Val lie Leu Arg Thr Lys Lys Leu Arg 
1 5 10 15 

Thr Pro Thr Asn lie Phe lie Leu Asn Leu Ala Val Ala Asp Leu Leu 
20 25 30 

Phe Leu Leu Thr Leu Pro Pro Trp Ala Leu Tyr Tyr Leu Val Gly Gly 
35 40 45 

Ser Glu Asp Trp Pro Phe Gly Ser Ala Leu Cys Lys Leu Val Thr Ala 
50 55 60 

Leu Asp Val Val Asn Met Tyr Ala Ser lie Leu Leu Leu Thr Ala lie 
65 70 75 80 

Ser lie Asp Arg Tyr Leu Ala lie Val His Pro Leu Arg Tyr Arg Arg 

85 90 95 

Arg Arg Thr Ser Pro Arg Arg Ala Lys Val Val lie Leu Leu Val Trp 
100 105 110 

Val Leu Ala Leu Leu Leu Ser Leu Pro Pro Leu Leu Phe Ser Trp Val 
115 120 125 

Lys Thr Val Glu Glu Gly Asn Gly Thr Leu Asn Val Asn Val Thr Val 
130 135 140 

Cys Leu lie Asp Phe Pro Glu Glu Ser Thr Ala Ser Val Ser Thr Trp 
145 150 155 160 

Leu Arg Ser Tyr Val Leu Leu Ser Thr Leu Val Gly Phe Leu Leu Pro 

165 170 175 

Leu Leu Val lie Leu Val Cys Tyr Thr Arg lie Leu Arg Thr Leu Arg 
180 185 190 

Lys Ala Ala Lys Thr Leu Leu Val Val Val Val Val Phe Val Leu Cys 
195 200 205 

Trp Leu Pro Tyr Phe lie Val Leu Leu Leu Asp Thr Leu Cys Leu Ser 
210 215 220 

lie lie Met Ser Ser Thr Cys Glu Leu Glu Arg Val Leu Pro Thr Ala 
225 230 235 240 

Leu Leu Val Thr Leu Trp Leu Ala Tyr Val Asn Ser Cys Leu Asn Pro 

245 250 255 

lie lie Tyr 
<210> 4 
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<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Forward primer 

<400> 4 

taacttggcc attgacttct t 



<210> 5 

<211> 22 

<212> DNA 

<213> Artificial 

<220> 

<223> Reverse Primer 

<400> 5 

attcgaacag catgtgaggg at 



<210> 6 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Probe 
<220> 

<221> modified_base 

<222> (1) . . (1) 

<223> (6-carboxyf luorescein) -t 



<220> 

<221> modified_base 

<222> (27) . • (27) 

<22 3> a- ( 6-carboxy-N , N ^ N ' N ' -tetramethylrhodamine ) 



<400> 6 

tacaaaggaa tggagatcac acccaca 



<210> 7 

<211> 29 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide 
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<400> 7 

aaggatccaa aatgccagat actaatagc 



29 



<210> 8 

<211> 28 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide 

<400> 8 

aagtcgactt aagaagatac tgaccgac 28 

<210> 9 

<211> 43 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 



<210> 10 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 10 

tctagattaa gaagatactg accgactgtg ttg 33 



<210> 11 

<211> 22 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 



<400> 9 

aagcttccac catgccagat actaatagca caatcaattt ate 



43 



<400> 11 

ggaaggatga aggtagtgaa tg 



22 



<210> 12 
<211> 22 
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<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Primer 


<400> 


12 


cagaatctga ttgggaggaa 

s 


<210> 


13 


<211> 


19 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Oligonucleotide 


<400> 


13 



22 



taacttggcc atctttgac 19 



<210> 


14 


<211> 


18 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Oligonucleotide 


<400> 


14 


attcgaacag cgtgtgag 


<210> 


15 


<211> 


19 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Oligonucleotide 


<220> 




<221> 


modified base 


<222> 


(1) . . (1) 


<223> 


5 ' - ( 6-carboxyf 1 


<400> 


15 



18 



tacaaaggaa tggagatca 19 
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HUMAN HISTAMINE H4 RECEPTOR 

This application claims priority under 35 U.S.C. § 119 from provisional patent 
application Serial Nos. 60/202, 151, filed May 5, 2000, 60/227,567, filed August 23, 2000, and 
60/247,855, filed November 13, 2000; which all are hereby incorporated by reference in their 
entireties. 

FIELD OF THE INVENTION 
The present invention discloses the identification of a novel histamine receptor, 

termed H4. 

BACKGROUND OF THE INVENTION 
Histamine produces numerous physiological effects in the body through 
interaction with one of three different cell surface receptors, classified as H,, H2, and H3. 
These receptors belong to the guanine nucleotide binding protein coupled receptor (G-protein 
coupled receptors, GPCR) class. 

Stimulation of histamine Hj receptors produces symptoms that are typically 
associated with physiological responses to allergic stimuli (Ash and Schild, Br. J. PharmacoL 
1966, 27:427). These effects are blocked by Hj antagonists such as, for example, 
diphenhydramine. H, antagonists are generally defined as "classical antihistamines". Classical 
antihistamines are the active ingredient in most over-the-counter allergy medications. 
Pharmacological studies indicate that agonist activation of these receptors stimulates the inositol 
phosphate pathway, and thus stimulates formation of inositol triphosphate (IP3) and 
diacylglycerol (DAG). 

Histamine H2 receptors have been shown to play a role in gastric acid secretions 
(Black et al. Nature 1972, 236:385). Histamine Hj receptor antagonists such as, for example, 
cimetidine and ranitidine, are often the active ingredient in over-the-counter and prescription 
drugs that are used to treat duodenal ulcers, gastric ulcers, heartburn, indigestion, and other 
disorders of the gastrointestinal tract. Activation of histamine Hj stimulates adenylyl cyclase 
activity and stimulates formation of cAMP. 



• t 

• J 
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Histamine H3 receptors are a relatively new member of the histamine receptor 
class. These receptors were originally described as histamine autoreceptors. These receptors 
were shown to be located on presynaptic histamine nerve termin£ds and to control the rate of 
histamine synthesis and release in the brain (Arrangef^ar/., Nature 1983,302:832). Recentstudies 
5 indicate that H3 receptors also are present on non-histamine presynaptic nerve terminals. H3 
receptors may modulate the synthesis and release of other neurotransmitters such as dopamine, 
serotonin, acetylcholine, and norepinephrine. H3 receptors also have been found in peripheral 
tissues. Histamine activation of H3 receptor inhibits adenylyl cyclase activity and formation of 
cAMP. 

10 Multiple pharmacological studies have indicated the presence of H3 receptor 

subtypes (Leurs et al JPET 1996, 276:1009-1015; Gumming and Gjede, Brain Res. 1994, 
641:203-207; Calpham and Kilpatrick, Br. J. Pharmacol 1996, 107:919-923; Schworer et al 
Naunyn-Schmiedeberg's Arch. Pharmacol. 1994, 350:375-379; Schkicker et al, Naunyn- 
Schmiedeberg's Arch. Pharmacol. 1996, 353:482-488). Additionally, pharmacological 

1 5 characterization of a histamine receptor on eosinophils describes the greater potency of histamine 
compared to R-a-methylhistamine (Raible, et al Am. J. Respir. Crit. Care.Med 1 994 , 1 49: 1 506- 
1511.). However, the existence of these receptor subtj^es has yet to be substantiated by 
molecular biological techniques. 

20 SUMMARY OF THE INVENTION 

The present invention contemplates an isolated histamine H4 receptor protein 
having an amino acid sequence at leeist 5 1 % identical or comprising at least 1 0 contiguous amino 
acids from the sequence depicted in SEQ ID NO:2. The H4 receptor protein binds ligands 
comprising an imidazole and an amine, which imidazole and amine are attached by an alkyl 

25 chain, where the rank order of efiScacy of modulation of second messenger formation of the 
ligands at the H4 receptor protein is 5>6=10>8=4, where the nimiber represents the nximber of 
carbons in the alkyl chain. In one embodiment, upon binding histamine or a histamine agonist 
the receptor protein inhibits second messenger formation. Preferably the second messenger is 
cAMP. 

30 The present application also discloses an isolated H4 receptor protein having an 

amino acid sequence with at least 95% sequence identity to hxmian H4 receptor protein having 
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an amino acid sequence as depicted in SEQ ID NO: 2. In a specific embodiment, the protein is 
a human H4 receptor protein, e,g.^ having an amino acid sequence as depicted in SEQ ID NO:2 
or an allelic variant of that sequence. 

Also provided are nucleic acids, e.^., cDNAs encoding the H4 receptor proteins 
5 as discussed above. In another embodiment, an isolated nucleic acid encoding an H4 receptor 
protein, which nucleic acid hybridizes imder stringent conditions to a nucleic acid having a 
sequence of at least 20 nucleotides identical to a corresponding nucleotide sequence of the same 
number of bases in SEQ ID NO :l or its complement. 

The isolated nucleic acids encoding H4 receptors can be part of vectors, e.g,^ for 
10 cloning, expression, and/or expansion. An expression vector comprises the nucleic acid 
encoding the H4 receptor protein operably associated with an expression control sequence. The 
invention further provides host cells and non-human transgenic animals containing such an 
expressible vector, and methods for producing an H4 receptor polypeptide using such host cells. 

In addition, the invention provides an isolated nucleic acid, such as a primer or 
1 5 probe, of at least 1 0 bases having a nucleotide sequence identical to a corresponding nucleotide 
sequence of the same number of bases in SEQ ID NO: 1, or its complement. The invention also 
provides an antibody that specifically binds an H4 receptor. 

The present invention further contemplates a method for detecting expression of 
H4 receptor, which method comprises detecting mRNA encoding H4 receptor in a sample from 
20 a cell suspected of expressing H4 receptor or detecting the H4 receptor protein with an antibody 
of the invention. 

The present invention also contemplates an assay system for identifying H4 
receptor ligands. The assay system comprises a sufficient number of transformed host cells to 
be able to detect an alteration in second messenger accvimxalation. Preferably, the second 

25 messenger is cAMP. 

The present invention also contemplates a method for identifying a test compound 
that antagonizes or agonizes histamine H4 receptors. The method comprises detecting an 
alteration in the level of a second messenger in the assay system contacted with the test 
compound. In the method, an increase in the level of the second messenger indicates that the test 

30 compound suitagonizes the H4 receptor. A decrease in the level of the second messenger 
indicates that the test compound agonizes the H4 receptor. 
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The present invention also discloses an isolated nucleic acid that specifically 
hybridizes under highly stringent conditions to the complement of the sequence depicted in SEQ 
ID NO: I , where the nucleic acid encodes a histamine H4 receptor protein. 

5 BRIEF DESCRIPTION OF DRAWINGS 

Figure 1. Sequence of the human H4 cDNA (SEQ ID NO: 1) and the deduced 
amino acid sequence of the protein it encodes (SEQ ID NO:2). The TM domains are denoted by 
lines. 

Figures 2A, 2B, and 2C. (A) Agonist assay. Square: histamine; triangle: R-a- 
1 0 methylhistamine; inverted triangle: clobenpropit; diamond: thioperamide. (B) Antagonist assay. 
Assay were conducted in duplicate and presented as the average ± SEM. (C) Activity in 
mammedian cells. 

Figure 3. Antagonist activity of a series of histamine analogs. Square: 
clobenpropit; triangle: thioperamide. Assay were conducted in duplicate and presented as the 
1 5 average ± SEM.' 

Figure 4. H4 expression in from 6 human T cell clones derived from a single 
human donor. Three of these clones were CD4+ cell clones (RG4.3B, RG4.3 A, and RG4.3), 
two v^ere CD8+ cell clones (RG8.1C and RG8.1A), and one was an NKT cell clone (RGl). 

Figure 5. H4 expression in bulk populations of CD4+ and CD8+ T cells isolated 
20 from three different individuals. B-CD8 and B-CD4 were CD8+ and CD4+ cells, respectively, 
obtained from individual 1. L-CD8 and L-CD4 were CD8+ and CD4+ cells, respectively, 
obtained from individual 2. U-CD8 and U-CD4 were CD8+ and CD4+ cells, respectively, 
obtained from individual 3. 

25 DETAILED DESCRIPTION 

The present invention is based, in part, on discovery of a novel histamine receptor, 
which has been termed The new histamine receptor was cloned from a human heart library 
using H3 specific primers, but proved to be a new histamine receptor. Yeast and human 
endothelial kidney (HEK) cells were transformed with a human H4 expression vector. 

30 Transformed cells in multi-well plates were treated with test compounds, and regulation of the 
intracellular second messenger cyclic adenosine monophosphate (cAMP) formation was 
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determined. Modulation of cAMP formation is iigand and concentration dependent. 

The nucleic acid and protein sequences of H4 shows homology to known G- 
protein coupled receptors. Specifically, the H4 protein shows homology to biogenic-amine G- 
protein coupled receptors. As in other biogenic amine receptors, the present protein contains 
5 conserved aspartate residues within transmembrane (TM) domains 2 and 3 (at positions 61 and 
94). A DY motif (D94 and Y95) in TM3 is found only in histamine and muscarinic receptors. 
A DRY sequence motif that exists at the cytoplasmic interface of TM3 of all GPCRs also can be 
found in the present sequence (amino acids 111- 113), Also present are two conserved cysteine 
residues in the first and second extracellular loops (position 88 and 164), which are predicted 

10 to form a disulfide bond; two conserved tryptophan residues in TM4 and TM6 (positions 1 40 and 
316); and conserved proline residues in TM5, TM6, and TM7 (positions 186, 318, and 355, 
respectively). The putative TM domains are shown in Figure 1 . H4 shares sequence identity 
with human H3 receptors. Sequence comparison indicates an overall sequence identity of about 
44% and overall sequence similarity of about 51%. 

15 The present invention also contemplates an assay method and system for 

identifying selective H4 receptor ligands. The method involves detecting binding of a test 
compound to isolated cell membranes containing the histamine H4 receptor. The assay system 
comprises transformed host cells that express H4 receptors, where the number of cells in the assay 
system is sufficient to detect an alteration in second messenger accumulation. The test system 

20 also includes an appropriate cell culture medium to permit cell growth and viability, and 
preferably tissue culture plates or arrays containing the host cells in cell culture medium. In a 
specijSc embodiment, the second messenger that is detected is cAMP. In a further embodiment, 
the receptor is a human receptor. 

The invention also discloses a method for identifying a test compound that 

25 antagonizes or agonizes histamine H4 receptors. The method comprises detecting an increase 
(antagonist) or decrease (agonist) in the level of a second messenger in the assay system when 
contacted with the test compound. 

Thus, the present invention advantageously provides H4 protein, including 
fragments, derivatives, and analogs of H4; H4 nucleic acids, including oligonucleotide primers 

30 and probes, and H4 regulatory sequences (especially an H4 primer and splice sites with introns); 
H4-specific antibodies; and related methods of using these materials to detect the presence of H4 
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proteins or nucleic acids, H4 binding partners, and in screens for agonists and antagonists of H4. 

The following sections of the application, which are delineated by headings (in 
bold) and sub-headings (in bold italics), which cover these three aspects of the invention, are 
5 provided for clarity, and not by way of limitation. 

General Definitions 
As used herein, the term "isolated" means that the referenced material is removed 
from the environment in which it is normally found. Thus, an isolated biological material can 
be free of cellular components, le., components of the cells in which the material is found or 

1 0 produced in nature. In the case of nucleic acid molecules, an isolated nucleic acid includes a 
PGR product, an isolated mRNA, a cDNA, or a restriction fragment. In another embodiment, 
an isolated nucleic acid is preferably excised from the chromosome in which it may be found, 
and more preferably is no longer joined to non-regulatory, non-coding regions, or to other genes, 
located upstream or downstream of the gene contained by the isolated nucleic acid molecide 

1 5 when found in the chromosome. In yet another embodiment, the isolated nucleic acid lacks one 
or more introns. Isolated nucleic acid molecules include sequences inserted into plasmids, 
cosmids, artificial chromosomes, and tlie like. Thus, in a specific embodiment, a recombinant 
nucleic acid is an isolated nucleic acid. An isolated protein may be associated with other proteins 
or nucleic acids, or both, wdth which it associates in the cell, or with cellular membranes if it is 

20 a membrane-associated protein. A protein expressed from a vector in a cell, particularly a cell 
in which the protein is normally not expressed is also a regarded as isolated. An isolated 
organelle, cell, or tissue is removed from the anatomical site in which it is found in a cell or an 
organism. An isolated material may be, but need not be, purified. 

The term "purified" as used herein refers to material that has been isolated under 

25 conditions that reduce or eliminate the presence of unrelated materials, i.e., contaminants, 
including native materials from which the material is obtained. For example, a purified protein 
is preferably substantially free of other proteins or nucleic acids with which it is associated in a 
cell; a purified nucleic acid molecule is preferably substantially free of proteins or other unrelated 
nucleic acid molecules with which it can be foimd within a cell. As used herein, the term 

30 "substantially free" is used operationally, in the context of analytical testing of the material. 
Preferably, purified material substantially fi:ee of contanunants is at least 50% pure; more 
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preferably, at least 90% pure; and more preferably still at least 99% pure. Purity can be evaluated 
by chromatography, gel electrophoresis, immunoassay, composition analysis, biological assay, 
and other methods known in the art. 

Methods for purification are well-known in the art. For example, nucleic acids 
5 can be purified by precipitation, chromatography (including preparative solid phase 
chromatography, oligonucleotide hybridization, and triple helix chromatography), 
ultracentrifugation, and other means. Polypeptides and proteins can be purified by various 
methods including, without limitation, preparative disc-gel electrophoresis, isoelectric focusing, 
HPLC, reversed-phase HPLC, gel filtration, ion exchange and partition chromatography, 

10 precipitation and salting-out chromatography, extraction, and countercurrent distribution. For 
some purposes, it is preferable to produce the polypeptide in a recombinant system in which the 
protein contains an additional sequence tag that facilitates purification, such as, but not limited 
to, a polyhistidine sequence, or a sequence that specifically binds to an antibody, such as FLAG 
and GST. The polypeptide can then be purified firom a crude lysate of the host cell by 

1 5 chromatography on an appropriate solid-phase matrix. Alternatively, antibodies produced against 
the protein or against peptides derived therefrom can be used as purification reagents. Cells can 
be purified by various techniques, including centrifugation, matrix separation {e,g,, nylon wool 
separation), panning and other immunoselection techniques, depletion (e.g., complement 
depletion of contaminating cells), and cell sorting {e.g., fluorescence activated cell sorting 

20 [FACS]). Other purification methods are possible. A purified material may contain less than 
about 50%, preferably less than about 75%, and most preferably less than about 90%, of the 
cellular components with which it was originally associated. The "substantially pure" indicates 
the highest degree of purity which can be achieved using conventional purification techniques 
known in the art. 

25 In a specific embodiment, the term "about" or "approximately" means within a 

scientifically acceptable error range for a given value relative to the precision with which the 
value is or can be measured, e.g., vsathin 20%, preferably within 1 0%, and more preferably within 
5% of a given value or range. Altematively, particularly with biological systems, the term can 
mean within an order of magnitude, preferably within 5-fold and more preferably within 2-fold 

30 of a given value. 

A "sample" as used herein refers to a biological material which can be tested for 
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the presence of protein or H4 nucleic acids. Such samples can be obtained from animal 
subjects, such as humans and non-human animals, and include tissue, especially muscle, 
biopsies, blood and blood products; plural effusions; cerebrospinal fluid (CSF); ascites fluid; and 
cell culture. 

5 Non-human animals include, without limitation, laboratory animals such as mice, 

rats, rabbits, hamsters, guinea pigs, etc.; domestic animals such as dogs and cats; and, farm 
animals such as sheep, goats, pigs, horses, and cows. 

The use of italics indicates a nucleic acid molecule; normal text indicates the 
polypeptide or protein. 

10 The term "ligand" refers to a compound that recognizes and binds to a receptor 

binding site. In a specific embodiment, the ligand binds to the histamine H4 receptors of the 
invention. Upon binding to the receptor, the ligand may produce agonist or antagonist functional 
effects. 

The term "agonist" refers to a ligand that binds to the receptor and produces a 
1 5 functional effect similar to that produced by the endogenous ligand for the receptor. In a specific 
embodiment, the agonist at the histamine H4 receptor produces an effect similar to that produced 
by histamine, the endogenous ligand (histamine) for the H4 receptor. Examples of such agonists 
include, but are not limited to, R-a-methyl histamine and imetit. 

The term "antagonist'' refers to a ligand that binds to the receptor and blocks a 
20 functional effect produced by an agonist for the receptor or the endogenous ligand of the receptor. 
Examples of such antagonists include, but are not limited to, thioperamide. 

The term "selective" refers to the ability of a histamine H4 agonist or antagonist 
to elicit a response from the H4 receptor while eliciting minimal responses from another receptor. 
Stated differently, a selective H4 agonist may be a potent agonist for the H4 receptor while 
25 agonizing another receptor, such as another G-protein coupled receptor and particularly another 
histamine receptor, poorly or not at all. 

The term "ability to elicit a response" refers to the ability of a H4 agonist or 
antagonist ligand to agonize or antagonize H4 receptor activity. 

As vised herein the term "transformed cell" refers to a modified host cell that 
30 expresses a functional H4 receptor expressed from a vector encoding the histamine receptor. 
Any cell can be used, preferably a mammalian cell, and more preferably a HEK cell. 
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A "functional histamine receptor" is a receptor that binds histamine or H4 agonists 
and transduces a signal upon such binding. Preferably the H4 receptor is a human H4. Preferably, 
the signal that is transduced is accumulation of a second messenger, preferably cAMP. 
Histamine H4 receptors may be derived from a variety of sources, including mammal, e,g., 
5 human, bovine, porcine, and canine; and avian. 

The cells of the invention are particularly suitable for an assay system for 
histamine H4 receptor ligands that modulate second messenger accumulation. An "assay system" 
is one or more collections of such cells, e,g, , in a microwell plate or some other culture system. 
To permit evaluation of the effects of a test compound on the cells, the number of cells in a single 
10 assay system is sufficient to express a detectable amount of the regulated second messenger 
accumulation at least under conditions of maximum second messenger acc\miulation. 

A "second messenger" is an intracellular molecule or ion, where formation and/or 
accumulation of the second messenger is regulated by activation of cellular membranes. In one 
embodiment, cellular membranes contain G-protein coupled receptor, ion chaimels, and tyroinse 
1 5 kinase receptors. In the context of this invention, the cellular membrane is a G-protein coupled 
receptor, preferably a histamine H4 receptor. In a specific embodiment, the second messenger 
is one or more of cAMP, cGMP, inositol phosphate, DAG, and ions such as calciimi and 
potassium. Preferably, the second messenger is cAMP. 

A "test compound" or "candidate compound" is any molecule that can be tested 
20 for its ability to bind H4 receptors, and preferably modulate second messenger accumulation 
through the H4 receptor, as set forth herein. A compound that binds, and preferably modulates 
H4 is a "lead compoimd" suitable for further testing and development as an H4 agonist or 
antagonist. 

As used herein, the term "provide" refers to supplying the compounds or 
25 pharmaceutical compositions of the present invention to cells or to an animal, preferably a 
human, in any form. For example, a prodrug form of the compounds may be provided the subject, 
which then is metabolized to the compound in the body. 



BNSDCXID: <WO_01B5786A3_IA> 



wo 01/085786 PCT/USOl/14527 . 

-10- 
H>| Receptor 

H4 receptor, as defined herein, refers to a polypeptide having about 390 amino 

acids. 

The protein is transcribed from a nucleic acid sequence that is about 11 73 base pairs in length. 
5 The H4 protein has significant homology to the H3 receptor. Thus, H4 refers to orthologs and 
allelic variants, e,g.^ a protein having greater than about 50%, preferably greater than 80%, more 
preferably still greater than 90%, and even more preferably greater than 95% overall sequence 
identity to SEQ ID NO: 2. Allelic variants may differ from 1 to about 5 amino residues from 
SEQ ID NO:2. In a specific embodiment, H4 has an amino acid sequence as shown in SEQ ED 
10 NO: 2. 

Sequence comparison studies between human H4 protein and human H3 protein, 
indicates sequence identity of about 44%. The predicted protein sequence contains residues 
which are characteristic of the class of biogenic amine receptors. As in other biogenic amine 
receptors, the present protein contains conserved aspartate residues within TM2 and TM3 (at 

15 positions 61 and 94). A DY motif, found only in histamine and muscarinic receptors, is found 
at positions 94-95. A DRY sequence motif that exists at the cytoplasmic interface of TM3 of all 
GPCRs also can be fovmd in amino acids 111-113. Also present are two conserved cysteine 
residues in the first and second extracellular loops (position 88 and 164), two conserved 
tryptophan residues in TM4 and TM6 (positions 140 and 316); and conserved proline residues 

20 m TM5, TM6, and TM7 (positions 1 86, 31 8, and 355, respectively). 

H4 receptors, like and H3 receptors, modulates adenylyl cyclase activity. 
Therefore, the receptor modulates accumulation of the intracellular messenger cAMP. 
Modulation of H4 receptors may be a treatment for transplanted organ rej ection, asthma, allergies, 
and autoimmune pathologies, such as, but not limited to, multiple sclerosis, type I diabetes, and 

25 rheumatoid arthritis. Additionally, CNS functions such as cognitive and memory defects may 
be treated. 

H4 fragments, derivatives, and analogs can be characterized by one or more of 
the characteristics of H4 protein. In a specific embodiment, in order to develop the specific 
C-terminal and N-terminal H4 antibodies, antibodies can be raised against extracellular or 
30 cytoplasmic portions of the H4 protein, preferably or antigenic peptides identified using a 
hydrophobicity profile or other algorithms. 
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Analogs and derivatives of the H4 receptor of the invention have the same or 
homologous characteristics of H4 as set forth above. For example, a truncated form of H4 can be 
provided. Such a truncated form includes H4 with a either an N-terminal, C-terminal, or internal 
deletion. In a specific embodiment, the derivative is functionally active, Le., capable of 
5 exhibiting one or more functional activities associated with a full-length, wild-type H4 of the 
invention. Such functions include, but are not limited to, inhibition of adenylyl cyclase activity 
and cAMP formation. Altematively, a H4 chimeric fusion protein can be prepared in which the 
H4 portion of the fusion protein has one or more characteristics of H4. Such fusion proteins 
include fusions of the H4 receptor with a marker polypeptide, such as FLAG, a histidine tag, a 

1 0 myc tag, or glutathione-S-transferase (GST). Alternatively, the H4 receptor can be fused with an 
expression-related peptide, such as yeast a-mating factor, a heterogeneous signal peptide, or a 
peptide that renders the protein more stable upon expression. H4 can also be fused with a unique 
phosphorylation site for labeling. 

H4 analogs can be made by altering encoding nucleic acid sequences by 

15 substitutions, additions or deletions that provide for functionally similar molecules, Le., 
molecules that perform one or more H4 functions. In a specific embodiment, an analog of H4 is 
a sequence-conservative variant of H4. In another embodiment, an analog of H4 is a function- 
conservative variant. In yet another embodiment, an analog of H4 is an allelic variant or a 
homologous variant from another species. In a specific embodiment, human variants of H4 are 

20 described. 

H4 derivatives include, but are by no means limited to, phosphoiylated H,, 
glycosylated H4, methylated H4, and other H4 proteins that are otherwise chemically modified. 
H4 derivatives also include labeled variants, e.g., radio-labeled with iodine (or, as pointed out 
above, phosphorous); a detectable molecule, such as but by no means limited to biotin, a 
25 chelating group complexed with a metal ion, a chromophore or fluorophore, a gold colloid, or 
a particle such as a latex bead; or attached to a water soluble polymer. 

Cloning and Expression of 
The present invention contemplates analysis and isolation of a gene encoding a 
30 functional or mutant H4, including a full length, or naturally occurring form of H4, and any 
antigenic fragments thereof from any source, preferably himian. It further contemplates 
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expression of functional or mutant H4 protein for evaluation, diagnosis, or therapy. 

In accordance with the present invention there may be employed conventional 
molecular biology, microbiology, and recombinant DNA techniques within the skill of the art. 
Such techniques are explained fully in the literature. See, e.g., Sambrook, Fritsch & Maniatis, 

5 Molecular Cloning: A Laboratory Manual, Second Edition (1989) Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York (herein "Sambrook et al, 1989"); DNA 
Cloning: A Practical Approach, Volumes I and n (D.N. Glover ed. 1985); Oligonucleotide 
Synthesis (MJ. Gait ed. 1984); Nucleic Acid Hybridization [B.D. Hames & SJ. Higgins eds. 
(1985)]; Transcription And Translation [B.D. Hames & S J. Higgins, eds. (1984)]; Animal Cell 

10 Culture [R.I. Freshney, ed. (1986)]; Immobilized Cells And Enzymes [IRL Press, (1986)]; 
B.Perbal, A Practical Guide To Molecular Cloning (1984); P.M. Ausubel et al (eds.). Current 
Protocols in Molecular Biology, John Wiley & Sons, Inc. (1994). 

Molecular Biology - Definitions 
"Amplification" of DNA as used herein denotes the use of polymerase chain 

1 5 reaction (PCR) to increase the concentration of a particular DNA sequence within a mixture of 
DNA sequences. For a description of PCR see Saiki et al. Science, 239:487, 1988. 

As used herein, "sequence-specific oligonucleotides" refers to related sets of 
oligonucleotides that can be used to detect allelic variations or mutations in the H4 gene. 

The nucleic acid molecules (polynucleotides) herein may be flanked by natural 

20 regulatory (expression control) sequences, or may be associated with heterologous sequences, 
including promoters, internal ribosome entry sites (IRES) and other ribosome binding site 
sequences, enhancers, response elements, suppressors, signal sequences, polyadenylation 
sequences, introns, 5 - and 3 - non-coding regions, and the like. The nucleic acids may also be 
modified by many means known in the art. Non-limiting examples of such modifications include 

25 methylation, "caps", substitution of one or more of the naturally occurring nucleotides with an 
analog, and intemucleotide modifications such as, for example, those with imcharged linkages 
(e.g., methyl phosphonates, phosphotriesters, phosphoroamidates, carbamates, etc.) and with 
charged linkages (e.g., phosphorothioates, phosphorodithioates, etc.). Polynucleotides may 
contain one or more additional covalently linked moieties, such as, for example, proteins (e.g., 

30 nucleases, toxins, antibodies, signal peptides, poly-L-lysine, etc.), intercalators (e.g., acridine, 
psoralen, etc.), chelators (e.g., metals, radioactive metals, iron, oxidative metals, etc.), and 
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alkylators. The polynucleotides may be derivatized by formation of a methyl or ethyl 
phosphotriester or an alkyl phosphoramidate linkage. Fxirthermore, the polynucleotides herein 
may also be modified with a label capable of providing a detectable signal, either directly or 
indirectly. Exemplaiy labels include radioisotopes, fluorescent molecules, biotin, and the like. 
5 A "coding sequence" or a sequence "encoding" an expression product, such as a 

RNA, polypeptide, protein, or enzyme, is a nucleotide sequence that, when expressed, results in 
the production of that RNA, polypeptide, protein, or enzyme, Le., the nucleotide sequence 
encodes an amino acid sequence for that polypeptide, protein or enzyme. A coding sequence for 
a protein may include a start codon (usually ATG) and a stop codon. 

1 0 The term "gene", also called a "structural gene" means a DNA sequence that codes 

for or corresponds to a particular sequence of amino acids which comprise all or part of one or 
more proteins or enzymes, and may or may not include introns and regulatory DNA sequences, 
such as promoter sequences, 5 -imtranslated region, or 3 -untranslated region which affect for 
example the conditions vmder which the gene is expressed, 

15 A "promoter sequence" is a DNA regulatory region capable of binding RNA 

polymerase in a cell and initiating transcription of a downstream (3' direction) coding sequence. 
For purposes of defining the present invention, the promoter sequence is bounded at its 3' 
terminus by the transcription initiation site and extends upstream (5' direction) to include the 
minimum nimiber of bases or elements necessary to initiate transcription at levels detectable 

20 above background. Within the promoter sequence will be found a transcription initiation site 
(conveniently defined for example, by mapping with nuclease SI), as well as protein binding 
domains (consensus sequences) responsible for the binding of RNA polymerase. The present 
invention includes the H4 receptor gene promoter foimd in the genome, which can be operatively 
associated with a H4 coding sequence with a heterologous coding sequence. 

25 The term "host cell" means any cell of any organism that is selected, modified, 

transformed, grown, or used or manipulated in any way, for the production of a substance by the 
cell, for example the expression by the cell of a gene, a DNA or RNA sequence, a protein or an 
enzyme. Host cells can further be used for screening or other assays, as described infra. 

A coding sequence is "under the control of or "operatively associated with" 

30 transcriptional and translational control sequences in a cell when RNA polymerase transcribes 
the coding sequence into mRNA, which is then trans-RNA spliced (if it contains introns) and 
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translated, in the case of mRNA, into the protein encoded by the coding sequence. 

The terms "express" and "expression" mean allowing or causing the information 
in a gene or DNA sequence to become manifest, for example producing a protein by activating 
the cellvilar functions involved in transcription and translation of a corresponding gene or DNA 
5 sequence. A DNA sequence is expressed in or by a cell to form an "expression product" such 
as a protein. The expression product itself , e.g., the resulting protein, may also be said to be 
"expressed" by the cell An expression product can be characterized as intracellular, 
extracellular, transmembrane, or secreted. The term "intracellular" means something that is 
inside a cell. The term "extracellular" means something that is outside a cell. The H4 receptor 
10 is a seven transmembrane protein with intracellular and extracellular domains. A substance is 
"secreted" by a cell if it appears in significant measure outside the cell, firom somewhere on or 
inside the cell. 

The terai "transfection" means the introduction of a foreign nucleic acid into a 
cell. The term "transfomiation" means the introduction of a "foreign" (z.e., extrinsic or 

15 extracellular) gene, DNA or RNA sequence to a host cell, so that the host cell v^U express the 
introduced gene or sequence to produce a desired substance, typically a protein or enzyme coded 
by the introduced gene or sequence. The introduced gene or sequence may also be called a 
"cloned" or "foreign" gene or sequence, may include regulatory or control sequences, such as 
start, stop, promoter, signal, secretion, or other sequences used by a cell's genetic machinery. 

20 The gene or sequence may include nonfunctional sequences or sequences with no known 
function. A host cell that receives and expresses introduced DNA or RNA has been 
'transformed" and is a "transformant" or a "clone." The DNA or RNA introduced to a host cell 
can come from any source, including cells of the same genus or species as the host cell, or cells 
of a different genus or species. 

25 The terms "vector", "cloning vector" and "expression vector" mean the vehicle 

by which a DNA or RNA sequence (e.g., a foreign gene) can be introduced into a host cell, so 
as to transform the host and promote expression (e.g., transcription and translation) of the 
introduced sequence. Vectors include plasmids, phages, viruses, etc.; they are discussed in 
greater detail below. 

30 Vectors typically comprise the DNA of a transmissible agent, into which foreign 

DNA is inserted. A common way to insert one segment of DNA into another segment of DNA 
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involves the use of enzymes called restriction enzymes that cleave DNA at specific sites 
(specific groups of nucleotides) called restriction sites. A "cassette" refers to a DNA coding 
sequence or segment of DNA that codes for an expression product that can be inserted into a 
vector at defined restriction sites. The cassette restriction sites are designed to ensure insertion 
5 of tiie cassette in the proper reading frame. Generally, foreign DNA is inserted at one or more 
restriction sites of the vector DNA, and then is carried by the vector into a host cell along with 
the transmissible vector DNA. A segment or sequence of DNA having inserted or added DNA, 
such as an expression vector, can also be called a "DNA construct." A common type of vector 
is a "plasmid", which generally is a self-contained molecule of double-stranded DNA, usually 

10 of bacterial origin, that can readily accept additional (foreign) DNA and which can readily 
introduced into a sxxitable host cell. A plasmid vector often contains coding DNA and promoter 
DNA and has one or more restriction sites suitable for inserting foreign DNA. Coding DNA is 
a DNA sequence that encodes a particular amino acid sequence for a particular protein or 
enzyme. Promoter DNA is a DNA sequence which initiates, regulates, or otherwise mediates or 

1 5 controls the expression of the coding DNA. Promoter DNA and coding DNA may be firom the 
same gene or from different genes, and may be from the same or different organisms. A large 
number of vectors, including plasmid and fungal vectors, have been described for replication 
and/or expression in a variety of eukaryotic and prokaryotic hosts. Non-limiting examples 
include pKK plasmids (Clonetech), pUC plasmids, pET plasmids (Novagen, Inc., Madison, WI), 

20 pRSET or pREP plasmids (Invitrogen, San Diego, CA), or pMAL plasmids (New England 
Biolabs, Beverly, MA), and many appropriate host cells, using methods disclosed or cited herein 
or otherwise known to those skilled in the relevant art. Recombinant cloning vectors will often 
include one or more replication systems for cloning or expression, one or more markers for 
selection in the host, e.g., antibiotic resistance, and one or more expression cassettes. 

25 The term "expression system" means a host cell and compatible vector imder 

suitable conditions, e.g., for the expression of a protein coded for by foreign DNA carried by the 
vector and introduced to the host cell. Common expression systems include E. coli host cells and 
plasmid vectors, insect host cells and Baculovims vectors, and mammalian host cells and vectors. 

30 The term "heterologous" refers to a combination of elements not naturally 

occurring. For example, heterologous DNA refers to DNA not naturally located in the cell, or 
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in a chromosomal site of the cell. Preferably, the heterologous DNA includes a gene foreign to 
the cell. A heterologous expression regulatory element is such an element operatively associated 
with a different gene than the one it is operatively associated with in nature. In the context of the 
present invention, an H4 gene is heterologous to the vector DNA in which it is inserted for 
5 cloning or expression, and it is heterologous to a host cell containing such a vector, in which it 
is expressed, e.g., a HEK cell. 

The terms "mutant" and "mutation" mean any detectable change in genetic 
material, e.g., DNA, or any process, mechanism, or result of such a change. This includes gene 
mutations, in which the structure (e.g., DNA sequence) of a gene is altered, any gene or DNA 

10 arising from any mutation process, and any expression product (e.g., protein or enzyme) 
expressed by a modified gene or DNA sequence. The term * Variant" may also be used to indicate 
a modified or altered gene, DNA sequence, enzyme, cell, etc., Le., any kind of mutant 

"Sequence-conservative variants" of a polynucleotide sequence are those in which 
a change of one or more nucleotides in a given codon position results in no alteration in the 

15 amino acid encoded at that position. 

"Function-conservative variants" are those in which a given amino acid residue 
in a protein or en2yme has been changed without altering the overall conformation and function 
of the polypeptide, including, but not limited to, replacement of an amino acid with one having 
similar properties (such as, for example, polarity, hydrogen bonding potential, acidic, basic, 

20 hydrophobic, aromatic, and the like). Amino acids with similar properties are well known in the 
art. For example, arginine, histidine and lysine are hydrophilic-basic amino acids and may be 
interchangeable. Similarly, isoleucine, a hydrophobic amino acid, may be replaced with leucine, 
methionine or valine. Such changes are expected to have little or no effect on the apparent 
molecular weight or isoelectric point of the protein or polypeptide. Amino acids other than those 

25 indicated as conserved may differ in a protein or en2yme so that the percent protein or amino acid 
sequence similarity between any two proteins of similar function may vary and may be, for 
example, from 70% to 99% as determined according to an alignment scheme such as by the 
Cluster Method, wherein similarity is based on the MEGALIGN algorithm. A 
"function-conservative variant" also includes a polypeptide or enzyme which has at least 60 % 

30 amino acid identity as determined by BLAST or FASTA algorithms, preferably at least 75%, 
most preferably at least 85%, and even more preferably at least 90%, and which has the same or 
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substantially similar properties or functions as the native or parent protein or enzyme to which 
it is compared. 

As used herein, the term "homologous" in all its grammatical forms and spelling 
variations refers to the relationship between proteins that possess a "common evolutionary 
5 origin," including proteins from superfamilies (e.g., the immunoglobulin superfandly) and 
homologous proteins from different species (e.g., myosin light chain, etc.) (Reeck et aL, Cell 
1987, 50:667). Such proteins (and their encoding genes) have sequence homology, £ts reflected 
by their sequence similarity, whether in terms of percent similarity or the presence of specific 
residues or motifs at conserved positions. 

1 0 Accordingly, the term "sequence similarity" in all its grammatical forms refers to 

the degree of identity or correspondence between nucleic acid or amino acid sequences of 
proteins that may or may not share a common evolutionary origin (see Reeck et aL, supra). 
However, in common usage and in the instant application, the term "homologous," when 
modified with an adverb such as "highly," may refer to sequence similarity and may or may not 

1 5 relate to a common evolutionary origin. 

hi a specific embodiment, two DNA sequences are "substantially homologous" 
or "substantially similar" when at least about 80%, and most preferably at least about 90 or 95% 
of the nucleotides match over the defined length of the DNA sequences, as determined by 
sequence comparison algorithms, such as BLAST, FASTA, DNA Strider, etc. An example of 

20 such a sequence is an allelic or species variant of the specific H4 gene of the invention. 
Sequences that are substantially homologous can be identified by comparing the sequences using 
standard software available in sequence data banks, or in a Southem hybridization experiment 
under, for example, stringent conditions as defined for that particular system. 

Similarly, in a particular embodiment, two amino acid sequences are 

25 "substantially homologous" or "substantially similar" when greater than 80% of the amino acids 
are identical, or greater than about 90% are similar (functionally identical). Preferably, the 
similar or homologous sequences are identified by alignment using, for example, the GCG 
(Genetics Computer Group, Program Manual for the GCG Package, Version 7, Madison, 
Wisconsin) pileup program, or any of the programs described above (BLAST, FASTA, etc) 

30 A nucleic acid molecule is "hybridizable" to another nucleic acid molecule, such 

as a cDNA, genomic DNA, or RNA, when a single stranded form of the nucleic acid molecule 
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can anneal to the other nucleic acid molecule under the appropriate conditions of temperature and 
solution ionic strength (see Sambrook et al, supra). The conditions of temperature and ionic 
strength determine the "stringency" of the hybridization. For preliminary screening for 
homologous nucleic acids, low stringency hybridization conditions, corresponding to a Tm 
(melting temperature) of 55 ^C, can be used, e.g., 5x SSC, 0,1% SDS, 0.25% milk, and no 
formamide; or 30% formamide, 5x SSC, 0.5% SDS. Moderate stringency hybridization 
conditions correspond to a higher Tm, e.g., 40% formamide, with 5x or 6x SSC. High stringency 
hybridization conditions correspond to the highest Tm, e.g. , 50% formamide, 5x or 6x SSC . SSC 
is a 0.15M NaCl, 0.0 15M Na-citrate buffer. Hybridization requires that the two nucleic acids 
contain complementary sequences, although depending on the stringency of the hybridization^ 
mismatches between bases are possible. The appropriate stringency for hybridizing nucleic acids 
depends on the length of the nucleic acids and the degree of complementation, variables well 
known in the art. The greater the degree of similarity or homology between two nucleotide 
sequences, the greater the value of Tm for hybrids of nucleic acids having those sequences. The 
relative stability (corresponding to higher Tm) of nucleic acid hybridizations decreases in the 
following order: RNArRNA, DNA:RNA, DNA:DNA. For hybrids of greater than 100 
nucleotides in length, equations for calculating Tm have been derived (see Sambrook et al, 
supra, 9.50-9.51). For hybridization with shorter nucleic acids, z.e., oligonucleotides, the 
position of mismatches becomes more important, and the length of the oligonucleotide 
determines its specificity (see Sambrook et al, supra, 11.7-11.8). A mimmum length for a 
hybridizable nucleic acid is at least about 1 0 nucleotides; preferably at least about 1 5 nucleotides; 
and more preferably the length is at least about 20 nucleotides. 

In a specific embodiment, the term "standard hybridization conditions" refers to 
a Tm of 55 ""C, and utilizes conditions as set forth above. In a preferred embodiment, the Tm is 
60 ''C; in a more preferred embodiment, the Tm is 65 °C. In a specific embodiment, "high 
stringency" refers to hybridization and/or washing conditions at 68° C in 0.2xSSC, at 42°C in 
50% formamide, 4xSSC, or under conditions that afford levels of hybridization equivalent to 
those observed under either of these two conditions. 

As used herein, the term "oligonucleotide" refers to a nucleic acid, generally of 
at least 10, preferably at least 15, and more preferably at least 20 nucleotides, preferably no more 
than 1 00 nucleotides, that is hybridizable to a genomic DNA molecule, a cDNA molecvde, or an 
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mRNA molecule encoding a gene, mRNA, cDNA, or other nucleic acid of interest. 
Oligonucleotides can be labeled, e,g., with ^^P-nucleotides or nucleotides to wliich a label, such 
as biotin, has been covalently conjugated. In one embodiment, a labeled oligonucleotide can be 
used as a probe to detect the presence of a nucleic acid. In another embodiment, oligonucleotides 
5 (one or both of which may be labeled) can be used as PGR primers, either for cloning full length 
or a fragment of H4, or to detect the presence of nucleic acids encoding H4. In a further 
embodiment, an oligonucleotide of the invention can form a triple helix with a H4 DNA 
molecule. Generally, oligonucleotides are prepared synthetically, preferably on a nucleic acid 
synthesizer. Accordingly, oligonucleotides can be prepared with non-naturally occurring 

1 0 phosphoester analog bonds, such as thioester bonds, etc. 

The present invention provides antisense nucleic acids (including ribozymes), 
which may be used to inhibit expression of H4 of the invention. Inhibition of H4 expression may 
be desired when upregulation of H4 receptor expression or excessive inhibition of cAMP 
formation induces disease states such as, transplant organ rejection; asthma; allergies; 

15 autoinmiune pathologies, such as, but not limited to, multiple sclerosis, type I diabetes, and 
rheumatoid arthritis; and CNS functions such as cognitive and memory defects. An "antisense 
nucleic acid" is a single stranded nucleic acid molecule which, on hybridizing under cytoplasmic 
conditions with complementary bases in an RNA or DNA molecule, inhibits the latter's role. If 
the RNA is a messenger RNA transcript, the antisense nucleic acid is a coiintertranscript or 

20 mRN A-interfering complementary nucleic acid. As presently used, "antisense" broadly includes 
RNA-RNA interactions, RNA-DNA interactions, ribozymes and RNase-H mediated arrest. 
Antisense nucleic acid molecules can be encoded by a recombinant gene for expression in a cell 
{e.g., U.S. Patent No. 5,814,500; U.S. Patent No. 5,811,234), or alternatively they can be 
prepared synthetically (e.g., U.S. Patent No. 5,780,607). 

25 Specific non-limiting examples of synthetic oligonucleotides envisioned for this 

invention include oligonucleotides that contain phosphorothioates, phosphotriesters, methyl 
phosphonates, short chain sdkyl, or cycloalkyl intersugar linkages or short chain heteroatomic or 
heterocyclic intersugar linkages. Most preferred are those with CHj-NH-O-CHj, CH2-N(CH3)-0- 
CH2, CH2-0-N(CH3)-CH2, CH2-N(CH3)-N(CH3)-CH2 and 0-N(CH3)-CH2-CH2 backbones (where 

30 phosphodiester is O-PO2-O-CH2). U.S. Patent No. 5,677,437 describes heteroaromatic 
olignucleoside linkages. Nitrogen linkers or groups containing nitrogen can also be used to 
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prepare oligonucleotide mimics (U.S. Patents No. 5,792,844 and No. 5,783,682). U.S. Patent 
No. 5,637,684 describes phosphoramidate and phosphorothioamidate oligomeric compounds. 
Also envisioned are oligonucleotides having morpholino backbone structures (U.S. Patent No. 
5,034,506). In other embodiments, such as the peptide-nucleic acid (PNA) backbone, the 
5 phosphodiester backbone of the oligonucleotide may be replaced with a poiyamide backbone, 
the bases being boimd directly or indirectly to the aza nitrogen atoms of the poiyamide backbone 
(Nielsen et ai. Science 254:1497, 1991). Other synthetic oligonucleotides may contain 
substituted sugar moieties comprising one of the following at tlie 2* position: OH, SH, SCH3, F, 
OCN, 0(CH2)„NH2 or 0(CH2)„CH3 where n is from 1 to about 10; CI to CIO lower alkyl, 

1 0 substituted lower alkyl, alkaryl or aralkyl; CI; Br; CN; CF3; OCF3; 0-; S-, or N-alkyl; 0-, S-, or 
N-alkenyl; SOCH3; SO2CH3; ONOj; NO^; N3; NH^; heterocycloalkyl; heterocycloalkaryl; 
aminoalkylammo; polyalkylamino; substitued silyl; a fluorescein moiety; an RNA cleaving 
group; a reporter group; an intercalator; a group for improving the pharmacokinetic properties 
of an oligonucleotide; or a group for improving the pharmacodynandc properties of an 

15 oligonucleotide, and other substituents having similar properties. Oligonucleotides may also 
have sugar mimetics such as cyclobutyls or other carbocyclics in place of the pentofuranosyl 
group. Nucleotide units having nucleosides other than adenosine, cytidine, guanpsine, thymidine 
and uridine, such as inosine, may be used in an oligonucleotide molecule. 

Nucleic Acids 

20 A gene encoding H4, whether genomic DNA or cDNA, can be isolated from any 

source, particularly from a hvunan cDNA or genomic library. Methods for obtaining H4 gene are 
well known in tiie art, as described above (see, e.g., Sambrook et al, 1989, supra). The DNA 
may be obtained by standard procedures known in the art from cloned DNA (e,g,, a DNA 
"library"), and preferably is obtained from a cDNA library prepared from tissues with high level 

25 expression of the protein, by chemical synthesis, by cDNA cloning, or by the cloning of genomic 
DNA, or fragments thereof, purified from the desired cell (See, for example, Sambrook et al, 
1989, supra; Glover, D.M. (ed.), 1985, DNA Cloning: A Practical Approach, MRL Press, Ltd., 
Oxford, U.K. Vol. I, II). Clones derived from genomic DNA may contain regulatory and intron 
DNA regions in addition to coding regions; clones derived from cDNA will not contain intron 

30 sequences. Whatever the source, the gene should be molecularly cloned into a suitable vector 
for propagation of the gene. Identification of the specific DNA fragment containing the desired 
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H4 gene may be accomplished in a number of ways. For example, a portion of a H4 gene 
exemplified infra can be purified and labeled to prepare a labeled probe, and the generated DNA 
may be screened by nucleic acid hybridization to the labeled probe (Benton and Davis, Science 
1977, 196:180; Grunstein and Hogness, Proc. Natl. Acad. Sci. U.S.A. 1975, 72:3961). Those 
5 DNA firagments with substantial homology to the probe, such as an allelic variant firom another 
individual, will hybridize. In a specific embodiment, highest stringency hybridization conditions 
are used to identify a homologous H4 gene. 

Further selection can be carried out on the basis of the properties of the gene, e.g., 
if the gene encodes a protein product having the isoelectric, electrophoretic, amino acid 
10 composition, partial or complete amino acid sequence, antibody binding activity, or ligand 
binding profile of H4 protein as disclosed herein. Thus, the presence of the gene may be detected 
by assays based on the physical, chemical, immunological, or functional properties of its 
expressed product. 

Other DNA sequences which encode substantially the same amino acid sequence 
15 as a H4 gene may be used in tiie practice of the present invention. These include but are not 
limited to allelic variants, species variants, sequence conservative variants, and functional 
vanants. 

Amino acid substitutions may also be introduced to substitute an amino acid v/ith 
a particularly preferable property. For example, a Cys may be introduced a potential site for 

20 disulfide bridges with another Cys. 

The genes encoding H4 derivatives and analogs of the invention can be produced 
by various methods known in the art. The manipulations which result in their production can 
occur at the gene or protein level. For example, the cloned H4 gene sequence can be modified 
by any of numerous strategies known in the art (Sambrook et aly 1989, supra). The sequence 

25 can be cleaved at appropriate sites with restriction endonuclease(s), followed by further 
enzymatic modification if desired, isolated, and ligated in vitro. In the production of the gene 
encoding a derivative or analog of H4, care should be taken to ensure that the modified gene 
remains within the same translational reading frame as the H4 gene, uninterrupted by translational 
stop signals, in the gene region where the desired activity is encoded. 

30 Additionally, the v-encoding nucleic acid sequence can be mutated in vitro or in 

v/vo, to create and/or destroy translation, initiation, and/or termination sequences, or to create 
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variations in coding regions and/or form new restriction endonuclease sites or destroy preexisting 
ones, to facilitate further in vitro modification. Such modifications can be made to introduce 
restriction sites and facilitate cloning the H4 gene into an expression vector. Any technique for 
mutagenesis known in the art can be used, including but not limited to, in vitro site-directed 
5 mutagenesis (Hutchinson, C, et al, J. Biol. Chem. 253:6551, 1978; ZoUer and Smith, DNA 
3:479-488, 1984; Oliphant et al. Gene 44:177, 1986; Hutchinson et al, Proc. Natl. Acad. Sci. 
U.S.A. 83:710, 1986), use of TAB*" linkers (Pharmacia), etc. PGR techniques are preferred for 
site directed mutagenesis (see Higuchi, 1989, "Using PGR to Engineer DNA", in PGR 
Technology: Principles and Applications for DNA Amphfication, H. Erlich, ed., Stockton Press, 

10 Chapter 6, pp. 61-70). 

The identified and isolated gene can then be inserted into an appropriate cloning 
vector. A large number of vector-host systems known in the art may be used. Possible vectors 
include, but are not limited to, plasmids or modified viruses, but the vector system must be 
compatible with the host cell used. Examples of vectors include, but are not limited to, E. coli, 

15 bacteriophages such as lambda derivatives, or plasmids such as Bluescript, pBR322 derivatives 
or pUG plasmid derivatives, e,g,, pGEX vectors, pmal-c, pFLAG, etc. The insertion into a 
cloning vector can, for example, be accomplished by ligating the DNA fragment into a cloning 
vector which has complementaiy cohesive termini. However, if the complementary restriction 
sites used to fragment the DNA are not present in the cloning vector, the ends of the DNA 

20 molecules may be enzymatically modified. Alternatively, any site desired may be produced by 
ligating nucleotide sequences (linkers) onto the DNA termini; these ligated linkers may comprise 
specific chemically synthesized oligonucleotides encoding restriction endonuclease recognition 
sequences, hi addition, simple PGR or overlapping PGR may be used to insert a fragment into 
a cloning vector. 

25 Recombinant molecules can be introduced into host cells via transformation, 

transfection, infection, electroporation, etc., so that many copies of the gene sequence are 
generated. Preferably, the cloned gene is contained on a shuttle vector plasmid, which provides 
for expansion in a cloning cell, e.g., E. coli, and facile purification for subsequent insertion into 
an appropriate expression cell line, if such is desired. For example, a shutde vector, which is a 

30 vector that can replicate in more than one type of organism, can be prepared for replication in 
both E. coli and Saccharomyces cerevisiae by linking sequences from an E. coli plasmid with 
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sequences form the yeast 2/^ plasmid. 

H4 Regulatory Nucleic Acids 
Elements of the H4 promoter can be identified by scanning the human genomic 
region upstream of the H4 start site, e,g., by creating deletion mutants and checking for 
5 expression, or with the TRANSFAC algorithm. Sequences up to about 6 kilobases (kb) or more 
upstream from the H4 start site can contain tissue-specific regulatory elements. 

The term "H4 promoter" encompasses artificial promoters. Such promoters can 
be prepared by deleting nonessential intervening sequences from tlie upstream region of the H4 
promoter, or by joining upstream regulatory elements from the H4 promoter with a heterologous 
10 minimal promoter, such as the CMV immediate early promoter. 

An H4 promoter can be operably associated with a heterogenous coding sequence, 
e.g., for reporter gene (luciferase and green fluorescent proteins are examples of reporter genes) 
in a constmct. This construct will result in expression of the heterologous coding sequence under 
control the H4 promoter, e.g., a reporter gene can be expressed, under conditions that imder 
1 5 normal conditions cause H4 expression. This construct can be used in screening assays, described 
below, for H4 agonists and antagonists. 

Expression of Polypeptides 
The nucleotide sequence coding for H4, or antigenic fragment, derivative or analog 
thereof, or a fimctionally active derivative, including a chimeric protein, thereof, can be inserted 
20 into an appropriate expression vector, z. e., a vector which contains the necessary elements for the 
transcription and translation of the inserted protein-coding sequence. Thus, a nucleic acid 
encoding H4 of the invention can be operationally associated with a promoter in an egression 
vector of the invention. Both cDNA and genomic sequences can be cloned and expressed imder 
control of such regulatory sequences. Such vectors can be used to express functional or 
25 functionally inactivated H4 polypeptides. 

The necessary transcriptional and translational signals can be provided on a 
recombinant expression vector, or they may be supplied by the native gene encoding H4 and/or 
its flanking regions. 

Potential host-vector systems include but are not limited to mammalian cell 
30 systems transfected vsdth expression plasmids or infected with virus (e.g., vaccinia virus, 
adenovirus, adeno-associated virus, herpes virus, etc.); insect cell systems infected with virus 
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(e.g., baculovirus); microorganisms such as yeast containing yeast vectors; or bacteria 
transformed with bacteriophage, DNA, plasmid DNA, or cosmid DNA. The expression 
elements of vectors vary in their strengths and specificities. Depending on the host-vector system 
utilized, any one of a number of suitable transcription and translation elements may be used. 
5 Expression of H4 protein may be controlled by any promoter/enhancer element 

known in the art, but these regulatory elements must be functional in the host selected for 
expression. Promoters which may be used to control H4 gene expression include, but are not 
limited to, cytomegalovirus (CMV) promoter (U.S. Patent Nos. 5,385,839 and 5,168,062), the 
SV40 early promoter region (Benoist and Chambon, 1981, Nature 290:304-310), the promoter 

10 contained in the 3' long terminal repeat of Rous sarcoma virus (Yamamoto, et al. Cell 22:787- 
797, 1980), the herpes tiiymidine kinase promoter (Wagner et al, Proc. Natl. Acad. Sci. U.S.A. 
78:1441-1445, 1981), the regulatory sequences of the metallothionein gene (Brinster et al. 
Nature 296:39-42, 1982); prokaryotic expression vectors such as the p-lactamase promoter 
(Villa-Komarofif, etal, Proc. Natl. Acad. Sci. U.S.A. 75:3727-3731, 1978), or the tac promoter 

15 (DeBoer, et al, Proc, Natl. Acad. Sci. U.S.A. 80:21-25, 1983); see also "Useful proteins from 
recombinant bacteria" in Scientific American, 242:74-94, 1980; promoter elements from yeast 
or other fungi such as the Gal 4 promoter, the ADC (alcohol dehydrogenase) promoter, PGK 
(phosphoglycerol kinase) promoter, alkaline phosphatase promoter; and transcriptional control 
regions that exhibit tissue specificity, particularly endothelial cell-specific promoters. 

20 Solubilized forms of the protein can be obtained by solubilizing inclusion bodies 

or reconstituting membrane components, e.g., by treatment with detergent, and if desired 
sonication or other mechanical processes, as described above. The solubilized protein can be 
isolated using various techniques, such as polyacrylamide gel electrophoresis (PAGE), isoelectric 
focusing, 2-dimensional gel electrophoresis, chromatography (e.g., ion exchange, afBnity, 

25 immunoaffinity, and sizing column chromatography), centrifugation, differential solubility, 
unmunoprecipitation, or by any other standard technique for the purification of proteins. 

Vectors 

A wide variety of host/expression vector combinations may be employed in 
expressing the DNA sequences of this invention. Useful expression vectors, for example, may 
30 consist of segments of chromosomal, non-chromosomal and synthetic DNA sequences. Suitable 
vectors include derivatives of SV40 and known bacterial plasmids, e.g., E. coli plasmids col El, 
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pCRl, pBR322, pMal-C2, pET, pGEX (Smith et aL, Gene 67:31-40, 1988), pMB9 and their 
derivatives, plasmids such as RP4; phage DNAS, e,g,, the numerous derivatives of phage 1, e.g., 
NM989, and other phage DNA, e.g., Ml 3 and filamentous single stranded phage DNA; yeast 
plasmids such as the l/j, plasmid or derivatives thereof; vectors useful in exikaryotic cells, such 
5 as vectors useful in insect or mammalian cells; vectors derived firom combinations of plasmids 
and phage DNAs, such as plasmids that have been modified to employ phage DNA or other 
expression control sequences; and the like. 

Viral vectors, such as lentiviruses, retroviruses, herpes viruses, adenoviruses, 
adeno-associated viruses, vaccinia virus, baculovirus, alphavirus, and other recombinant viruses 

10 with desirable cellular tropism are also useful. Thus, a gene encoding a functional or mutant H4 
protein or polypeptide domain fragment thereof can be introduced in vzvo, ex v/vo, or in viti'O 
using a viral vector or through direct introduction of DNA. Expression in targeted tissues can 
be effected by targeting the transgenic vector to specific cells, such as with a viral vector or a 
receptor ligand, or by using a tissue-specific promoter, or both. Targeted gene delivery is 

15 described in International Patent Publication WO 95/28494, published October 1995. 

Viral vectors commonly used for in vivo or ex vivo targeting and therapy 
procedures are DNA-based vectors and retroviral vectors. Methods for constructing and using 
viral vectors are known in the art (see, e.g.. Miller and Rosman, BioTechniques 1992, 7:980- 
990). Preferably, the viral vectors are replication defective, that is, they are unable to replicate 

20 autonomously in the target cell. In general, the genome of the replication defective viral vectors 
which are used within the scope of the present invention lack at least one region which is 
necessary for the replication of the viras in the infected cell. These regions can either be 
eliminated (in whole or in part) or be rendered non-functional by any technique known to a 
person skilled in the art. These techniques include the total removal, substitution (by other 

25 sequences, in particular by the inserted nucleic acid), partial deletion or addition of one or more 
bases to an essential (for replication) region. Such techniques may be performed in vitro (on the 
isolated DNA) or m situ, using the techniques of genetic manipulation or by treatment with 
mutagenic agents. Preferably, the replication defective virus retains the sequences of its genome 
which are necessary for encapsidating the viral particles. 

30 DNA viral vectors include an attenuated or defective DNA virus, such as but not 

limited to herpes simplex vims (HSV), papillomavirus, Epstein Barr viras (EBV), adenoviras. 
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adeno-associated vims (AAV), and the like. Defective viruses, which entirely or almost entirely 
lack viral genes, are preferred. Defective virus is not infective after introduction into a cell. Use 
of defective viral vectors allowrs for administration to cells in a specific, localized area, without 
concern that the vector can infect other cells. Thus, a specific tissue can be specifically targeted. 
5 Examples of particular vectors include, but are not limited to, a defective herpes virus 1 (HSV 1) 
vector (Kaplitt et al., Molec. Cell. Neurosci. 2:320-330, 1991), defective herpes virus vector 
lacking a glyco-protein L gene (Patent Publication RD 3 7 1 005 A), or other defective herpes virus 
vectors (International Patent Publication No. WO 94/21807, published September 29, 1994; 
International Patent Publication No. WO 92/05263, published April 2, 1994); an attenuated 

10 adenovirus vector, such as the vector described by Stratford-Perricaudet et al (J. Clin. Invest. 
90:626-630, 1992; see also La Salle etaly Science 259:988-990, 1993); and a defective adeno- 
associated virus vector (Samulski et al , J. Virol. 61 :3096-3 101,1 987; Samulski et al , J. Virol. 
63:3822-3828, 1989; Lebkowski et al, MoL Cell. Biol. 8:3988-3996, 1988). 

Various companies produce viral vectors commercially, including but by no 

15 means limited to Avigen, Inc. (Alameda, CA; AAV vectors). Cell Genesys (Foster City, CA; 
retroviral, adenoviral, AAV vectors, and lentiviral vectors), Clontech (retroviral and baculoviral 
vectors), Genovo, Inc. (Sharon Hill, PA; adenoviral and AAV vectors), Genvec (adenoviral 
vectors), IntroGene (Leiden, Netherlands; adenoviral vectors), Molecular Medicine (retroviral, 
adenoviral, AAV, and herpes viral vectors), Norgen (adenoviral vectors), Oxford BioMedica 

20 (Oxford, United Kingdom; lentiviral vectors), and Transgene (Strasbourg, France; adenoviral, 
vaccinia, retroviral, and lentiviral vectors). 

Preferably, for in vivo administration, an appropriate immunosuppressive 
treatment is employed in conjunction with the viral vector, e.g., adenovirus vector, to avoid 
immuno-deactivation of the viral vector and transfected cells. For example, immunosuppressive 

25 cytokines, such as interleukin-12 (IL-12), interferon-y (IFN-y), or anti-CD4 antibody, can be 
provided to block humoral or cellular immune responses to the viral vectors (see, e,g,, Wilson, 
Nature Medicine, 1995). In that regard, it is advantageous to employ a viral vector that is 
engineered to express a minimal number of antigens. 

In another embodiment, the vector can be introduced in vivo by lipofection, as 

30 naked DNA, or with other transfection facilitating agents (peptides, polymers, etc.). Synthetic 
cationic lipids can be used to prepare liposomes for in vivo transfection of a gene encoding a 
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marker (Feigner, et. al., Proc. Natl. Acad. Sci. U.S.A. 84:7413-7417, 1987; Feigner and Ringold, 
Science 337:387-388, 1989; see Mackey, et al, Proc. Natl. Acad. Sci. U.S.A. 85:8027-8031, 
1988; Ulmer, et al. Science 259:1745-1748, 1993). Useful lipid compoiinds and compositions 
for transfer of nucleic acids are described in International Patent Publications WO 95/1 8863 and 
5 WO 96/17823, and in U.S. Patent No. 5,459,127. Lipids may be chemically coupled to other 
molecules for the purpose of targeting (see Mackey, et al, supra). Targeted peptides, e.g., 
hormones or neurotransmitters, and proteins such as antibodies, or non-peptide molecules could 
be coupled to liposomes chemically. 

Other molecules are also useful for facilitating transfection of a nucleic acid in 

10 vivo, such as a cationic oligopeptide {e.g., International Patent Publication WO 95/21931), 
peptides derived from DNA binding proteins {e.g., International Patent Publication WO 
96/25508), or a cationic polymer (e.g.. International Patent Publication W095/21931). 

Altematively, non-viral DNA vectors for gene therapy can be introduced into the 
desired host cells by methods known in the art, e.g., electroporation, microinjection, cell fiision, 

1 5 DEAE dextran, calcium phosphate precipitation, use of a gene gun (ballistic transfection; see, 
e.g., U.S. Pat. No. 5,204,253, U.S. Pat. No. 5,853,663, U.S. Pat. No. 5,885,795, and U.S. Pat. No. 
5,702,384 and see Sanford, TIB-TECH, 6:299-302, 1988; Fynan et al, Proc. Natl. Acad. Sci. 
U.S.A., 90:1 1478-1 1482, 1993; and Yang et al, Proc. Natl. Acad Sci. U.S.A., 87:1568-9572, 
1990), or use of a DNA vector transporter (see, e.g., Wu, et al, J. Biol. Chem. 267:963-967, 

20 1992; Wu and Wu, J. Biol. Chem. 263:14621-14624, 1988; Hartmut, et al, Canadian Patent 
Application No. 2,012,31 1, filed March 15, 1990; Williams, etal, Proc. Natl. Acad. Sci. USA 
88:2726-2730, 1991). Receptor-mediated DNA delivery approaches can also be used (Curiel, 
et al. Hum. Gene Ther. 3:147-154, 1992; Wu and Wu, J. Biol. Chem. 262:4429-4432, 1987). 
U.S. Patent Nos. 5,580,859 and 5,589,466 disclose delivery of exogenous DNA sequences, free 

25 of transfection facilitating agents, in a mammal. Recently, a relatively low voltage, high 
efficiency in vivo DNA transfer technique, termed electrotransfer, has been described (Mir, et al , 
CP. Acad. Sci., 321:893, 1998; WO 99/01157; WO 99/01158; WO 99/01175). 

H4 Ligands and Binding Partiaers 
30 The present invention further permits identification of physiological ligands and 

binding partners of H4. One method for evaluating and identifying H4 binding partners is the 
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yeast two-hybrid screen. Preferably, the yeast two-hybrid screen is performed using an cell 
library with yeast that are transformed with recombinant H4. Aitematively, H4 can be used as a 
capture or affinity purification reagent, hi another alternative, labeled H4 can be used as a probe 
for binding, e.g., by immunoprecipitation or Western analysis. Expected H4 binding partners are 
5 G-proteins. 

Generally, binding interactions between H4 and any of its binding partners will 
be strongest under conditions approximating those found in the cytoplasm, ie,, physiological 
conditions of ionic strength, pH and temperature. Perturbation of these conditions will tend to 
disrupt the stability of a binding interaction. 

10 

Antibodies to 

Antibodies to H4 are useful, inter alia^ for diagnostics and intracellular regulation 
of H4 activity, as set forth below. According to the invention, a H4 polypeptide produced 
recombinantly or by chemical synthesis, and fragments or other derivatives or analogs thereof, 

1 5 including fusion proteins, may be used as immunogens to generate antibodies that recognize the 
H4 polypeptide. Such antibodies include but are not limited to polyclonal, monoclonal, chimeric, 
single chain. Fab fragments, and an Fab expression library. Such an antibody is preferfably 
specific for human H4 and it may recognize either a mutant form of H4 or wild-type H4, or both. 

One can use the hydropathic index of amino acids, as discussed by Kyte and 

20 Doolittle(JMolBioL 1982, 157:105-132), See, forexample, U.S. Patent 4,554,101, whichstates 
that the greatest local average hydrophilicity of a "protein," as governed by the hydrophilicity of 
its adjacent amino acids, correlates with its immunogenicity. Accordingly, it is noted that 
substitutions can be made based on the hydrophilicity assigned to each amino acid. In using 
either the hydrophilicity index or hydropathic index, which assigns values to each amino acid, 

25 it is preferred to introduce substitutions of amino acids where these values are ± 2, with ± 1 being 
particularly preferred, and those within ±0.5 being the most preferred substitutions. 

Various procedures known in the art may be used for the production of polyclonal 
antibodies to H4 polypeptide or derivative or analog thereof. For the production of antibody, 
various host animals can be immunized by injection with the H4 polypeptide, or a derivative (e.g., 

30 fragment or fusion protein) thereof, including but not limited to rabbits, mice, rats, sheep, goats, 
etc. In one embodiment, the H4 polypeptide or fragment thereof can be conjugated to an 
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immunogenic carrier, e.g,, bovine serum albumin (BSA) or keyhole limpet hemocyanin (KLH). 
Various adjuvants may be used to increase the immunological response, depending on the host 
species, including but not limited to Freund's (complete and incomplete), mineral gels such as 
aluminum hydroxide, surface active substances such as ly solecithin, pluronic polyols, polyanions, 
5 peptides, oil emulsions, keyhole limpet hemocyanins, dinitrophenol, and potentially useful 
human adjuvants such as BCG (bacille Calmette-Guerin) and Corynebacterium parvum. 

For preparation of monoclonal antibodies directed toward the H4 polypeptide, or 
fragment, analog, or derivative thereof, any technique that provides for the production of 
antibody molecules by continuous cell lines in culture may be used. These include but are not 

10 limited to the hybridoma technique originally developed by Kohler and Milstein (Nature 1975, 
256:495-497), as well as the trioma technique, the human B-cell hybridoma technique (Kozbor 
et aL, Immunology Today 1983, 4:72; Cote etaL, Proc. Natl. Acad. Sci. 1983, 80:2026-2030), 
. and the EBV-hybridoma technique to produce hioman monoclonal antibodies (Cole et al^ in 
Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., 1985, pp. 77-96). In an 

15 additional embodiment of the invention, monoclonal antibodies can be produced in germ-free 
animals (International Patent Publication No. WO 89/1 2690). In fact, according to the invention, 
techniques developed for the production of "chimeric antibodies" (Morrison etal, J. Bacteriol. 
1984, 159:870; Neuberger et al. Nature 1984, 312:604-608; Takeda et al. Nature 1985, 
314:452-454) by splicing the genes from a mouse antibody molecule specific for an H4 

20 polypeptide together wdth genes from a human antibody molecule of appropriate biological 
activity can be used; such antibodies are within the scope of this invention. Such human or 
humanized chimeric antibodies are preferred for use in therapy of human diseases or disorders 
(described infra\ since the human or humanized antibodies are much less likely than xenogenic 
antibodies to induce an immune response, in particular an allergic response, themselves. 

25 Antibody fragments which contain ttie idiotype of the antibody molecule can be 

generated by known techniques. For example, such fragments include but are not limited to: the 
F(ab')2 fragment which can be produced by pepsin digestion of the antibody molecule; the Fab' 
fragments which can be generated by reducing the disulfide bridges of the F(ab')2 fragment, and 
the Fab fragments which can be generated by treating the antibody molecule with papain and a 

30 reducing agent. 

According to the invention, techniques described for the production of single 
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chain antibodies (U.S. Patent Nos. 5,476,786, 5,132,405, and U.S. Patent 4,946,778) can be 
adapted to produce H4 polypeptide-specific single chain antibodies. An additional embodiment 
of the invention utilizes the techniques described for the construction of Fab expression libraries 
(Huse etal. Science 1 989, 246:1275-1281) to allow rapid and easy identification of monoclonal 
5 Fab fragments with the desired specificity for an H4 polypeptide, or its derivatives, or analogs. 

In the production and use of antibodies, screening for or testing with the desired 
antibody can be accomplished by techniques knovm in the art, e.g., radioimmunoassay, ELISA 
(en2yme-linked immunosorbant assay), "sandwich" immunoassays, immimoradiometric assays, 
gel diffusion precipitin reactions, immunodiffusion assays, in situ immunoassays (using colloidal 

10 gold, enzyme or radioisotope labels, for example), Western blots, precipitation reactions, 
agglutination assays {e.g., gel agglutination assays, hemagglutination assays), complement 
fixation assays, immunofluorescence assays, protein A assays, and immxmoelectrophoresis 
assays, etc. In one embodiment, antibody binding is detected by detecting a label on the primary 
antibody. In another embodiment, the primary antibody is detected by detecting binding of a 

15 secondary antibody or reagent to the primary antibody. In a further embodiment, the secondary 
antibody is labeled. Many means are known in the art for detecting binding in an immunoassay 
and are within the scope of the present invention. For example, to select antibodies which 
recognize a specific epitope of an H4 polypeptide, one may assay generated hybridomas for a 
product which binds to an H4 polypeptide fragment containing such epitope. For selection of an 

20 antibody specific to an H4 polypeptide Jfrom a particular species of animal, one can select on the 
basis of positive binding with H4 polypeptide expressed by or isolated from cells of that species 
of animal. 

The foregoing antibodies can be used in methods known in the art relating to the 
localization and activity of the H4 polypeptide, e.g,, for Western blotting, imaging H4 polypeptide 

25 in situ, measuring levels thereof in appropriate physiological samples, etc. using any of the 
detection techniques mentioned above or known in the art. Such antibodies can also be used in 
assays for ligand binding, e.g., as described in U.S. Patent No. 5,679,582. Antibody binding 
generally occurs most readily under physiological conditions, e.g., pH of between about 7 and 
8, and physiological ionic strength. The presence of a carrier protein in the buffer solutions 

30 stabilizes the assays. While there is some tolerance of perturbation of optimal conditions, e.g., 
increasing or decreasing ionic strength, temperature, or pH, or adding detergents or chaotropic 
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salts, such perturbations will decrease binding stability. 

In a specific embodiment, antibodies that act as ligands and agonize or antagonize 
the activity of H4 polypeptide can be generated. In addition, intracellular single chain Fv 
antibodies can be used to regulate cAMP formation (Marasco et al., Proc. Natl. Acad. Sci. U.S.A. 
5 1993, 90:7884-7893; Chen., Mol. Med. Today 1997, 3:160-167; Spitz et al.. Anticancer Res, 
1996, 16:3415-22; Indolfi et al., Nat. Med. 1996, 2:634-635; Kijma et al., Pharaiacol. Ther. 
1 995, 68:247-267). Such antibodies can be tested using the assays described infra for identifying 
ligands. 



10 Screening and ChemiHstrv 

According to the present invention, nucleotide sequences encoding H4 and the H4 
receptor structure, which can be modeled from the amino acid sequence based on homology to 
other GPCR proteins, are useful targets to identify drugs that are effective in treating disorders 
associated with histamine-regulated processes. Drug targets include without limitation (i) 

1 5 isolated nucleic acids derived from the gene encoding H4 (e.g., antisense or ribozyme molecules) 
and (ii)small molecule compounds that recognize and bind the receptor. 

In particular, identification and isolation of H4 provides for development of 
screening assays, particularly for high throughput screening of molecules that up- or down- 
regulate the activity of H4. Accordingly, the present invention contemplates methods for 

20 identifying specific histamine receptor ligands that interact with H4 receptors, using various 
screening assays known in the art. 

Any screening technique known in the art can be used to screen for H4 agonists 
or antagonists. The present invention contemplates screens for small molecule ligands or ligand 
analogs and mimics, as well as screens for natural ligands that bind to and agonize or antagonize 

25 H4 activity in vivo. For example, natural products libraries can be screened using assays of the 
invention for molecules that agonize or antagonize H4 expression or activity. 

Another approach uses recombinant bacteriophage to produce large libraries. 
Using the "phage method" (Scott and Smith, Science 1990, 249:386-390; Cwirla, et al, Proc. 
Natl. Acad. Sci., USA 1990, 87:6378-6382; Devlin a/.. Science 1990,49:404-406), very large 

30 libraries can be constructed (106-108 chemical entities). A second approach uses primarily 
chemical methods, of which the Geysen method (Geysen et al. Molecular Immunology 1986, 
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23:709-715; Geysene^a/. J. Immunologic Method 1987 102:259-274; and the method of Fodor 
etai (Science 1991, 251:767-773) are examples. Furka etaL (14th International Congress of 
Biochemistry, Volume #5 1988, Abstract FR:013; Furka, Int. J. Peptide Protein Res. 1991, 
37:487-493), Houghton (U.S. Patent No. 4,631,211) and Rutter (U.S. Patent No. 5,010,175) 
5 describe methods to produce a mixture of peptides that can be tested as agonists or antagonists. 

In another aspect, synthetic libraries (Needels et ah , Proc. Natl. Acad. Sci. USA 
1993, 90:10700-4; Ohlmeyer et al, Proc. Natl. Acad. Sci. USA 1993, 90:10922-10926; Lam 
et al , PCX Publication No. WO 92/00252; Kocis et al , PCX Publication No. WO 9428028) and 
the like can be used to screen for ligands that regulate H4 activity. Xest compounds are screened 

10 from large libraries of synthetic or natural compounds. Numerous means are currently used for 
random and directed synthesis of saccharide, peptide, and nucleic acid based compounds. 
Synthetic compound libraries are commercially available from Maybridge Chemical Co. 
(Xrevillet, Cornwall, UK), Comgenex (Princeton, NJ), Brandon Associates (Merrimack, NH), 
and Microsource (New Milford, CX). A rare chemical library is available from Aldrich 

1 5 (Milwaukee, WI). Alternatively, libraries of natural compounds in the form of bacterial, fungal, 
plant and animal extracts are available from e.g. Pan Laboratories (Bothell, WA) or MycoSearch 
(NC), or are readily producible. Additionally, natural and synthetically produced libraries and 
compounds are readily modified through conventional chemical, physical, and biochemical 
means (Blondelle et al , Xib Xech 1 996, 14:60). 

20 Knowledge of the primary sequence of H4, and the similarity of that sequence with 

proteins of known ftmction, can provide an initial clue as to the structure of agonists or 
antagonists of the receptor. Identification and screening of agonists antagonists is fiirther 
facilitated by determining structural features of the receptor, e.g., using X-my crystallography, 
neutron diffraction, nuclear magnetic resonance spectrometry, homology studies, structure- 

25 activity relationships, and other techniques for stmcture determination. Xhese techniques provide 
for the rational design or identification of agonists and antagonists. 

One technique that may be used to assess the affinity of a test compound for the 
H4 receptor is a competition binding assay. In this assay, test wells containing an aliquot of a 
lipid bilayer membranes that contain the histamine H4 receptor are incubated with an known 

3 0 concentration of a radiolabeled ligand for the receptor. Xhe lipid bilayer may be prepared by any 
known protocol that separates the membrane containing receptor component from the 
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cytoplasmic components. Each well also is incubated with a different concentration of a 
unlabeled test compound. Cell membranes are then separated from the incubation mixture by 
any method known in the art including, but not limited to, centrifugation and vacuum filtration 
on a cell harvester. The radioactivity of each well is then determined using any device that can 
5 detect radioactivity, such as a scintillation counter. As increasing concentrations of the test 
compound compete for the receptor binding site, the radioactivity detected decreases. The data 
then can be converted using the Cheng-Prusoff equation (Biochem Pharmacol. 1973, 
22:3099-3 1 08) to determine the affinity (K,) of the compound for the receptor. 

In vivo screening methods 

10 Intact cells or whole animals expressing a gene encoding H4 can be used in 

screening methods to identify candidate drugs. 

In one series of embodiments, a permanent cell line is established. Alternatively, 
cells (including without limitation manmialian, insect, yeast, or bacterial cells) are transiently 
progranraied to express an H4 gene by introduction of appropriate DN A or mRNA. Identification 

15 of candidate compounds can be achieved using any suitable assay, including without limitation 
(i) assays that measure binding of test compounds to H4 (ii) assays that measure the ability of a 
test compound to modify (z. e. , inhibit or enhance) a measurable activity or function of H4 and (iii) 
assays that measure the ability of a compound to modify (z.e., inhibit or enhance) the 
transcriptional activity of sequences derived from the promoter (/.e., regulatory) regions of the 

20 H4 gene. 

H4 knockout mammals can be prepared for evaluating the molecular pathology 
of this defect in greater detail than is possible with human subjects. Such animals also provide 
excellent models for screening drug candidates. A "knockout mammal" is an mammal (e.g., 
mouse, rabbit) that contains within its genome a specific gene that has been inactivated by the 

25 method of gene targeting (see, e.g., U.S. Patent Nos. 5,777,195 and 5,616,491). A knockout 
mammal includes both a heterozygote knockout {i.e., one defective allele and one wild-type 
allele) and a homozygous mutant {i,e„ two defective alleles; a heterologous construct for 
expression of an H4, such as a human H4, could be inserted to permit the knockout mammal to 
live if lack of H4 expression was lethal). Preparation of a knockout mammal requires first 

30 introducing a nucleic acid construct that will be used to suppress expression of a particular gene 
into an undifferentiated cell type termed an embryonic stem cell. This cell is then injected into 



BNSDCXJID: <WO_0iaS7B6A3JA> 



wo 01/085786 



PCT/USOl/14527 



-34- 

a mammalian embryo. A mammalian embryo with an integrated cell is then implanted into a 
foster mother for the duration of gestation, Zhou, et al (Genes and Development 1995, 
9:2623-34) describes PPCA knock-out mice. 

The term "knockout" refers to partial or complete suppression of the expression 
5 of at least a portion of a protein encoded by an endogenous DNA sequence in a celL The term 
"knockout construct" refers to a nucleic acid sequence that is designed to decrease or suppress 
expression of a protein encoded by endogenous DNA sequences in a cell. The nucleic acid 
sequence used as the knockout construct is typically comprised of (1) DNA from some portion 
of the gene (exon sequence, intron sequence, and/or promoter sequence) to be suppressed and 

10 (2) a marker sequence used to detect the presence of the knockout construct in the cell. The 
knockout construct is inserted into a cell, and integrates with the genomic DNA of the cell in 
such a position so as to prevent or interrupt transcription of the native DNA sequence. Such 
insertion usually occurs by homologous recombination (z.e., regions of the knockout construct 
that are homologous to endogenous DNA sequences hybridize to each other when the knockout 

15 constmct is inserted into the cell and recombine so that the knockout construct is incorporated 
into the corresponding position of the endogenous DNA). The knockout constmct nucleic acid 
sequence may comprise (1) a full or partial sequence of one or more exons and/or introns of the 
gene to be suppressed, (2) a full or partial promoter sequence of the gene to be suppressed, or (3) 
combinations thereof Typically, the knockout construct is inserted into an embryonic stem cell 

20 (ES cell) and is integrated into the ES cell genomic.DNA, usually by the process of homologous 
recombination. This ES cell is then injected into, and integrates with, the developing embryo. 

The phrases "disraption of the gene" and "gene dismption" refer to insertion of 
a nucleic acid sequence into one region of the native DNA sequence (usually one or more exons) 

25 and/or the promoter region of a gene so as to decrease or prevent expression of that gene in the 
cell as compared to the wild-type or naturally occurring sequence of the gene. By way of 
example, a nucleic acid constmct can be prepared containing a DNA sequence encoding an 
antibiotic resistance gene which is inserted into the DNA sequence that is complementary to the 
DNA sequence (promoter and/or coding region) to be dismpted. When this nucleic acid construct 

30 is then transfected into a cell, the constmct will integrate into the genomic DNA. Thus, many 
progeny of the cell will no longer express the gene at least in some cells, or will express it at a 
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decreased level, as the DNA is now disrupted by the antibiotic resistance gene. 

Generally, the DNA will be at least about 1 kb in length and preferably 3-4 kb in 
length, thereby providing sufficient complementary sequence for recombination when the 
knockout construct is introduced into the genomic DNA of the ES cell (discussed below). 
5 Included within the scope of this invention is a mammal in which two or more 

genes have been knocked out. Such mammals can be generated by repeating the procedures set 
forth herein for generating each knockout construct, or by breeding to mammals, each vsdth a 
single gene knocked out, to each other, and screening for those vnth the double knockout 
genotjpe. 

10 Regulated knockout animals can be prepared using various systems, such as the 

tet-repressor system (see U.S. Patent No. 5,654, 1 68) or the Cre-Lox system (see U.S. Patent Nos. 

4,959,317 and 5,801,030). 

Li another series of embodiments, transgenic animals are created in which (i) a 

human H4 is stably inserted into the genome of the transgenic animal; and/or (ii) the endogenous 
15 H4 genes are inactivated and replaced with human H4 genes. See, e.g., CofEhian, Semin. Nephrol. 

1997, 17:404; Esther et al. Lab. Invest. 1996, 74:953; Murakami et al.. Blood Press. Suppl. 

1996, 2:36. 

H4 Activation Assay 
Any cell assay system that allows for assessment of functional activity of H4 

20 agonists and antagonists is defined by the present invention. In a specific embodiment, 
exemplified infra^ the assay can be used to identify compounds that selectively interact with H4, 
which can be evaluated by assessing the effects of H4 transformed cells contacted with a test 
compovmd, which modulates cAMP accumulation. The assay system can tiaus be used to identify 
compounds tliat selectively produce a functional effect through histamine H4 receptors. 

25 Compoxmds that increase cAMP formation and accumulation may be useful as novel therapeutics 
in the prevention of transplanted organ rejection, asthma, allergies, and autoimmune pathologies, 
such as, but not limited to, multiple sclerosis, type I diabetes, and rheumatoid arthritis. 
Additionally, CNS functions such as cognitive and memory defects may be treated. Preferably, 
each experiment is performed in triplicate at multiple different dilutions of test compound. 

30 An agonist and/or antagonist screen involves detecting cAMP accumulation by 

the host cell when contacted wdth H4 ligand. If cAMP accumulation is increased, the test 
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compound is a candidate antagonist of H4 receptors. If cAMP accumulation is decreased, the test 
compound is a candidate agonist of H4 receptors. If there is no change in cAMP formation, the 
test compound is not an effective H4 ligand. 

Any convenient method permits detection of the formed product, cAMP, For 
5 example, the invention provides immunoassays for detecting cAMP. Typically, immunoassays 
xise either a labeled antibody or a labeled antigenic component (e.g., that competes with the 
antigen in the sample for binding to the antibody). Suitable labels include without limitation 
enzyme-based, fluorescent, chemiluminescent, radioactive, or dye molecules. Assays that 
amplify the signals j&om the probe are also known, such as, for example, those that utilize biotin 

10 and avidin, and enzyme-labelled immunoassays, such as ELISA assays. Alternatively, labeled 
antigenic component may be quantified by scintillation techniques. In another method, the 
second messenger, preferably cAMP, will be separated on a high performance liquid 
chromatograph and quantified by a UV detector. 

The assay system described here also may be used in a high-throughput primary 

15 screen for agonists and antagonists, or it may be used as a secondary fimctional screen for 
candidate compounds identified by a different primary screen, e.g., a binding assay screen that 
identifies compounds that interact with the receptor. 

High-Throughput Screen 
Agents according to the invention may be identified by screening in high- 

20 throughput assays, including without limitation cell-based or cell-firee assays. It will be 
appreciated by those skilled in the art that different types of assays can be used to detect different 
types of agents. Several methods of automated assays have been developed in recent years so 
as to permit screening of tens of thousands of compounds in a short period of time. Such high- 
throughput screening methods are particularly preferred. The use of high-throughput screening 

25 assays to test for agents is greatly facilitated by the availability of large amounts of purified 
polypeptides, as provided by the invention. 

Compounds 

"Histamine" refers to a neurotransmitter that is produced and released fi"om 
30 neurons. Histamine is formed fi-om the amino acid histidine by histidine decarboxylase. 

Structurally, histamine is an imidazolethylamine. In other words, histamine is comprised of 
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an imidazole moiety and an amino group connected by an alkyi chain. The 
pharmacologically active form of histamine is proposed to be the monocationic tautomer, 
where one of the nitrogen present in the imidazole ring is positively charged. However, 
different forms of histamine may interact with histamine receptors to produce a functional 
5 effect. Histamine is produced intracellularly and stored until released in response to a 
physiological stimulus. 

"Histamine analogs*' refers to compounds that comprise the imidazole, alkyl 
chain, and a nitrogen moiety (e.g., amine (unsubstituted or substituted), piperidine, pyridine) 
of histamine, but may be modified at other positions. These modifications may be performed 

10 to alter affinity and/or selectivity of the compotind for the histamine receptors. "Histamine 
compounds" refers to compounds that may bind to the histamine receptors. 

Histamine analogs and compounds can be classified as agonists or antagonists. 
As discussed previously, agonists are ligands that bind to the receptor and produce a 
functional effect similar to that produced by the endogenous ligand (i.e., histamine) for the 

1 5 receptor, whereas antagonists are ligands that bind to the receptor and block a functional 
effect produced by an agonist for the receptor or the endogenous ligand of the receptor. 
Histamine analogs and compounds are further described in Goodman and Oilman's The 
Pharmacological Basis of Therapeutics, Ninth Edition, McGraw-Hill, 1996. 

Agonists that may be contemplated by this invention include, but are not 

20 limited to, R-(a)-methylhistamine, imetit, and immepip. Antagonists, burimamide, 

impromidine, dimaprit, and thioperamide clobenpropit and iodophenpropit impentamine, 
GT2016 and iodoproxyfan. Other compounds include derivatives, metabolites, and 
precursors. 

25 Methods of Diagnosis 

According to the present invention, genetic variants of H4 can be detected to 
diagnose an H4 associated disease, such as treatment for transplanted organ rejection, asthma, 
allergies, and autoimmune pathologies, such as, but not limited to, multiple sclerosis, type I 
diabetes, and rheumatoid arthritis. Additionally, CNS functions such as cognitive and 

30 memory defects may be treated. The various methods for detecting such variants are 

described herein. Where such variants inipact H4 function, either as a result of a mutated 
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amino acid sequence or because the mutation results in expression of a truncated protein, or 
no expression at all, they are expected to result in disregulation of the allergic response, the 
immune response, cognition and memory. 

Nucleic Acid Assays 
The DNA may be obtained from any cell source. DNA is extracted from the 
cell source or body fluid using any of the numerous methods that are standard in the art. It 
will be understood that the particular method used to extract DNA will depend on the nature 
of the source. Generally, the minimum amount of DNA to be extracted for use in the present 
invention is about 25 pg (conresponding to about 5 cell equivalents of a genome size of 4 x 
10' base pairs). 

In another alternate embodiment, RNA is isolated from biopsy tissue using 
standard methods well known to those of ordinary skill in the art such as guanidium 
thiocyanate-phenol-chlorofoim extraction (Chomocyznski et al. Anal. Biochem., 162:156, 
1987). The isolated RNA is then subjected to coupled reverse transcription and amplification 
by polymerase chain reaction (RT-PCR), using specific oligonucleotide primers that are 
specific for a selected site. Conditions for primer aimealmg are chosen to ensure specific 
reverse transcription and amplification; thus, the appearance of an amplification product is 
diagnostic of the presence of a particular genetic variation. In another embodiment, RNA is 
reverse-transcribed and amplified, after which the amplified sequences are identified by, e.g.. 
durect sequencing. In stiU another embodunent, cDNA obtained from the RNA can be cloned 
and sequenced to identify a mutation. 

Protein Assays 

In an alternate embodiment, biopsy tissue is obtained from a subject. 
Antibodies that are capable of specifically binding to H4 are then contacted with samples of 
the tissue to determine the presence or absence of a H4 polypeptide specified by the antibody. 
The antibodies may be polyclonal or monoclonal, preferably monoclonal. Measurement of 
specific antibody binding to cells may be accomplished by any known method, e.g.. 
quantitative flow cytometry, enzyme-linked or fluorescence-linked immimoassay. Western 
analysis, etc. 
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Therapeutic Uses 

According to the present invention, stimulation of H4 receptor activity may be 
used as a treatment option in patients with histamine-related disease states. Stimulation of H4 
receptor activity may be by methods, such as, but not limited to, (i) providing polypeptides that 
5 stimulate receptor activity and (ii) providing compounds that stimulate receptor activity. 

Gene Therapy 

Li a specific embodiment, vectors comprising a sequence encoding a protein, 
including, but not limited to, full-length H4, are provided to treat or prevent a disease or disorder 
associated with the function of H4 in peripheral blood leukocytes. In this embodiment of the 
10 invention, the therapeutic vector encodes a sequence that produces the protein of the invention. 

Any of the methods for gene therapy available in the art can be used according to 
the present invention. Exemplary methods are described below. 

For general reviews of the methods of gene therapy, see, Goldspiel et al , Clinical 
Pharmacy, 1993, 12:488-505; Wu and Wu, Biotherapy, 1991, 3:87-95; Tolstoshev, Ann. Rev. 
15 Pharmacol. Toxicol., 1993, 32:573-596; Mulligan, Science, 1993, 260:926-932; and Morgan and 
Anderson, Ann. Rev. Biochem., 1993, 62:191-217; May, TIBTECH, 1993, 11:155-215. Methods 
commonly known in the art of recombinant DNA technology that can be used are described in 
Ausubel et al, (eds.), 1993, Current Protocols in Molecular Biology, John Wiley & Sons, NY; 
Kriegler, 1990, Gene Transfer and Expression, A Laboratory Manual, Stockton Press, NY; and 
20 in Chapters 12 and 13, Dracopoli et aL, (eds.), 1994, Current Protocols inHuman Genetics, John 
Wiley & Sons, NY. Vectors suitable for gene therapy are described above. 

In one aspect, the therapeutic vector comprises a nucleic acid that expresses a 
protein of the invention in a suitable host In particular, such a vector has a promoter 
operationally linked to the coding sequence for the protein. The promoter can be inducible or 
25 constitutive and, optionally, tissue-specific. In another embodiment, a nucleic acid molecule is 
used in which the protein coding sequences and any other desired sequences are flanked by 
' regions that promote homologous recombination at a desired site in the genome, thus providing 

for intrachromosomal expression of the protein (KoUer and Smithies, Proc. Natl. Acad. Sci. 
U.S.A, 1989, 86:8932-8935; Zijlstra a/.. Nature, 1989, 342:435-438). 
' 30 Delivery of the vector into a patient may be either direct, in which case the patient 

I is directly exposed to the vector or a delivery complex, or indirect, in which case, cells are first 

i 
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transfoimed with the vector in vitro then transplanted into the patient. These two approaches are 
known, respectively, as in vivo and ex vivo gene therapy. 

In a specific embodunent, the vector is directly provided in vivo, where it enters 
the cells of the organism and mediates expression of the protein. This can be accomplished by 
any of numerous methods known in the art, by constructing it as part of an appropriate expression 
vector and administering it so that it becomes intracellular, e.g.. by infection using a defective 
or attenuated retroviral or other viral vector (see, U.S. Patent No. 4,980,286), or by direct 
injection of naked DNA, or by use of microparticle bombardment {e.g.. a gene gun; Biolistic, 
Dupont); or coating with lipids or cell-surface receptors or transfecting agents, encapsulation in 
biopolymers {e.g.. poly-S-l-64-N-acetylglucosamine polysaccharide; see, U.S. Patent No. 
5,63 5,493), encapsulation in liposomes, microparticles, or microcapsules; by administering it in 
Imkage to a peptide or other Ugand known to enter the nucleus; or by administering it in linkage 
to a ligand subject to receptor-mediated endocytosis (see, e.g.. Wu and Wu, J. Biol. Chem., 1 987, 
62:4429-4432), etc. In another embodiment, a nucleic acid ligand complex can be formed in 
which the ligand comprises a fusogenic vkal peptide to disrupt endosomes, allowing the nucleic 
acid to avoid lysosomal degradation. In yet another embodiment, the nucleic acid can be targeted 
in vivo for cell specific uptake and expression, by targeting a specific receptor (see, e.g. PCT 
Publication Nos. WO 92/06180, WO 92/22635, WO 92/20316 and WO 93/14188). 
Alternatively, the nucleic acid can be introduced intracellularly and incorporated within host ceU 
DNA for expression by homologous recombination (KoUer and Smithies, Proc. Natl. Acad. Sci. 
USA, 1989, 86:8932-8935; Zijlstra, et al.. Nature, 1989, 342:435-438). These methods are in 
addition to those discussed above in conjunction with "Viral and Non-viral Vectors" 

The form and amount of therapeutic nucleic acid envisioned for use depends on 
the type of disease and the severity of the desired effect, patient state, etc., and can be determined 
by one skilled in the art. 

Inhibition or stimulation of protein synthesis 
Gene transcription and protein translation may be inhibited or stimulated by 
administration of exogenous compounds. Exogenous compounds may interact witii 
extracellular and/or intracellular messenger systems, such as, but not limited to, adenosine 
triphosphate, nitiic oxide, and guanosine diphosphate; to regulate protein synthesis. In this 
embodiment, exogenous compounds that stimulate or inhibit H4 protein synthesis may be 
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used in the prevention and/or treatment for transplanted organ rejection, asthma, allergies, and 
autoimmune pathologies, such as, but not limited to, multiple sclerosis, type I diabetes, and 
rheumatoid arthritis. Additionally, CNS functions such as cognitive and memory defects may 
be treated. 

5 The present invention provides antisense nucleic acids (including ribozymes), 

which may be used to inhibit expression of H4 of the invention. The antisense nucleic acid, 
upon hybridizing under cytoplasmic conditions vnth complementary bases in an RNA or 
DNA molecule, inhibits the role of the RNA or DNA. Additionally, hybridization of the 
antisense nucleic acid to the DNA or RNA may inhibit transcription of the DNA into RNA 

10 and/or translation of the RNA into the protein. If the RNA is a messenger RNA transcript, 

the antisense nucleic acid is a countertranscript or mRNA-interfering complementaiy nucleic 
acid. Antisense nucleic acid molecules can be encoded by a recombinant gene for expression 
in a cell (e.g., U.S. Patent No. 5,814,500; U.S. Patent No. 5,81 1,234) or can be prepared 
synthetically (e.g., U.S. Patent No. 5,780,607). 

15 Alternatively, antibody molecules can also be administered, for example, by 

expressing nucleotide sequences encoding single-chain antibodies within the target cell 
population by utilizing, for example, techniques such as those described in Marasco et al 
(Proc. Natl. Acad Sci. USA, 1993, 90:7889-7893). 

Therapeutically suggested compounds may be provided to the patient in 

20 formulations that are known in the art and may include any pharmaceutically acceptable 
additives, such as excipents, lubricants, diluents, flavorants, colorants, and disintegrants. The 
fomiulations may be produced in useful dosage units such as tablet, caplet, capsule, liquid, or 
injection. 

The form and amount of therapeutic compound envisioned for use depends on the 
25 type of disease and the severity of the desired effect, patient state, etc., and can be determined by 
one skilled in the art. 

EXAMPLES 

The present invention will be better understood by reference to the following 
30 Examples, which are provided as exemplary of the invention, and not by way of limitation. 
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EXAMPLR T; CHARACTERIZATION OF THF. R, RECEPTOR 

A GPCR sequence profile was generated from a sequence alignment of members 
of this subfamily based on hidden Markov Models (Eddy Bioinformatics, 1998 14:755-763; 
Durbin et al. A tutorial introduction to hidden Markov models and other probabilistic modeling 
5 approaches in computational sequence analysis, Cambridge University Press, 1 998) to look for 
novel members in the human genome database. In detail, GPCR proteins of the biogenic amine 
subfamily were retrieved from Swiss-Prot database and a sequence alignment was generated by 
a multiple sequence alignment tool, named CLUSTALW (Thompson et al. Nucleic Acids Res, 
1 994, 22:4673-80). Using the HMMER program (Eddy, HMMER User's Guide and Program, 
10 Version 2.1, 1998), a consensus sequence (GNLLVILVIL RTKKLRTPTN IFILNLAVAD 
LLFLLTLPPW ALYYLVGGSE DWPFGSALCK LVTALDVVNM YASILLLTAI 
SIDRYLAIVH PLRYRRRRtS PRRAKVVILL VWVLALLLSL PPLLFSWVKT 

VEEGNGTLNVNVTVCLroFPEESTASVSTWLRSYVLLSTLVGFLLPLLVILVCYIRn.RT 
LRKAAKTLLV VVVVFVLCWL PYFIVLLLDT LCLSIIMSST CELERVLPTA 
LLVTLWLAYV NSCLNPIIY; SEQ ID NO: 3) was developed from the biogenic amine 
subfamily members. The consensus sequence contained the unique 7-transmembrane sequence 
structure of biogenic amine GPCRs. 

A weekly update of nucleotide sequence from GenBank database is maintained 
in-house. An auto-search script using TBLASTN program was written and the biogenic amine 
GPCR consensus sequence was used to search this database weekly. Every TBLASTN search 
result was carefully examined and potential open reading frame (ORF) fragments were extracted 
from nucleotide sequence. Each fragment was further verified to determine its novelty. 

Four peptide fragments translated from a recently released human genomic 
sequence (Accession number: AC007922) of chromosome 18 clone RP11-178F10, from the 
25 Whitehead Institute/MIT Center for Genome Research, were shown to have modest homology 
to differentregions of this GPCR consensus sequence. These four fragments located to different 
regionsof sequence AC009722 in both plus and minus strands. Interestingly, the highest scoring 
hit from searching the protein database with these peptides by BLASP was in all cases, the 
human histamine receptor 3 (H3). It is very likely that these four fragments are exons of a GPCR 
gene and their appearance in both sfrands of this genomic sequence may result from incorrect 
genomic contig assembly. 



20 



30 
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To clone the full length ORF, multiple primer sets were designed against the 
predicted sequence. Using multiple primer pairs, 3 overlapping pieces spanning the entire ORF 
were obtained from a human heart marathon cDNA library (Clontech, Palo Alto, CA). Using the 
marathon adapter primer API, and a reverse primer corresponding to ORF bases 644-615, a 
5 fragment of the H4 cDNA was obtained which corresponds to bases 276-644 of the H4 coding 
region (exon sequence) and 39 bases of intronic sequence at the 5* end. Primers comprising 
nucleotides 461-482 and 1 173-1 146 were used to PGR a 712bp band. The 5' sequence was 
obtained using primers comprising nucleotides 1-32 and 339-309. The outer primers (1-32, 
1 173-1146) were used to piece the 3 fragments together. The sequence was ligated into the 
10 mammalian expression vector pCDNA3. 1 + zeo (Invitrogen, Carlsbad, CA). 

EXAMPLE II; TISSUE EXPRESSION OF THE RECEPTOR 

Quantitative RT-PCR was performed on an ABI 7700 "Taqman" sequence 
detection system to determine the tissue distribution of the H4 receptor. Primers spaiming the 

15 exon 1-2 boundary (Forward primer: 5'-taacttggccattgacttctt-3'(SEQ ID NO:4); Reverse primer: 
5-attcgaacagcatgtgagggat-3' (SEQ ID NO:5) and a Probe: 5 -(6-carboxyfluorescein)- 
tacaaaggaatggagatcacacccaca- (6-carboxy-N,N,N'N'-tetramethyb:hodamine)-3' (SEQ ID NO:6)) 
were used to determine H4 expression levels in a mRNA prepared from a series of human tissues, 
purchased from Clontech (Palo Alto). A 2 step reaction procedure was performed as per 

20 manufacturers directions. Briefly, 2mg RNA was reverse transcribed using random hexamers 
(2.5mM in a final volume of 20ml. 14ul of this was used in the PGR reaction. The cyling 
conditions were as follows, 95 °C for 10 minutes, followed by 40 cycles of (a) 95 °C for 15 
seconds and (b) 60 °C for 1 minute. 

25 EXAMPLE III: PHARMACOLOGICAL PROFILE OF THE RECEPTOR IN YEAST 

CELLS 

The H4 receptor protein coding sequences were amplified using Forward oligo: 
5'-aaggatccaaaatgccagatactaatagc-3' (SEQ ID NO:7) and Reverse oligo: 5'- 
aagtcgacttaagaagatactgaccgac-S' (SEQ ID NO:8) that add BamHI and yeast consensus 
30 translational initiation sites to the 5' end and a Sail site to the 3* end. The fragment was cloned 
into corresponding sites in the multicopy yeast expression vector, p426GPD, thus placing 
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receptor expression under control of the strong constitutive GPDl promoter. The yeast 
expression plasmid, pMP327, was introduced into MPY578i5 cells (MATa ura3 his3 trpl leu2 
lys2 ade2 farl::LYS2 fusl::FUSl-fflS3 sst2::SST2-G418R ste2::LEU2, gpal::GPAli5) 
(Hadcock, J,R. and Pausch, M.H. submitted) using LiOAc and selected for ura prototrophy. In 
5 order to facilitate coupling of the receptor to the G protein, MPY578i5 cells express a chimeric 
G alpha protein coupled to the mating signal transduction pathway. The chimeric construct is 
expressed from the GPAl locus and is composed of Gpal sequences in which the 5'-C-terminal 
amino acids have been replaced with those of Gai3. A multicopy FUSl-LacZ reporter gene 
plasmid, pMP283 (Hadcock, J.R. and Pausch, M.H. submitted), was subsequently introduced into 

10 H4R-containing MPY578i5 cells and selected on media lacking trp and ura. The resulting yeast 
strain, MPY733, was used for further analysis. 

Samples (250 ng) of compounds present in the LOPAC panel (Sigma RBI, Natick 
MA), were dispensed to 96 well microtiter dishes. MPY733 cells (5 x lOVml, 200 ml/well) in 
assay medivmi (SCD-ura-trp, pH 6.8, 25 mM PIPES, O.lmg/ml Chlorophenylred 

15 bglactopyranoside(CPRG), 2 mM 3- AT) were added and cultured overnight at 30 °C. The 
presence of active compounds was detected the next day by measurement of absorbance at 570 
nm using a Wallac Victor H. The LOPAC panel was screened in dupUcate and in parallel with 
another yeast strain containing a different orphan GPCR. Only compounds that produced 
significantly elevated absorbance in both receptor containing plates and not in the other GPCR 

20 containing plates were deemed active. 

EXAMPLE IV: PHARMACOLOGICAL PROFILE OF THE H. RECEPTOR IN 
MAMMALIAN CELLS 

The ORF was modified by PGR for mammalian expression of H4. A 5' Hindm 

25 restriciton en2ymes site and a Kozak consensus sequence were added using the primer 5'- 
aagcttccaccatgccagatactaatagcacaatcaatttatc-3' (SEQ ID NO:9), and a 3' Xbal site added 5'- 
tctagattaagaagatactgaccgactgtgttg-3^ (SEQ ID NO: 1 0). The sequence was confirmed and Ugated 
into the Hindin and Xba I sites of the mammalian expression vector pCDNA3.1+ zeo 
(Invitrogen, Carlsbad CA), HEK 293 cells (approximately 1 0^ cells) were transfected with the 

30 pCDNA3. H-zeo/H4 using standard lipofectamine plus reagent (Life Technologies). Cells were 
maintained in DMEM containing 1 0% fetal calf serum and penicillin ( 1 00units/ml)/streptomycin 
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(lOOmg/ml). 48 hours post transfection cells were selected using 500mg/ml zeocin. Zeocin 
resistant clones were assayed using the cAMP assay and by RT-PCR. RNA was extracted from 
approximately 10* cells using One step PGR kit (Life technologies). The primers used in the 
extraction were 5'-ggaaggatgaaggtagtgaatg-3' (SEQ IDNO:l 1) and 5 -cagaatctgattgggaggaagg-3' 
5 (SEQ ID NO; 12). 

HEK cells stably expressing the H4 receptor were assessed functionally in cAMP 
assays using the cAMP scintillation proximity assay (SPA) (Amersham Pharmacia Biotech, 
Pistcataway, NJ). Briefly, 40 000 cells were plated into wells of a 96 well plate. 24 hours later 
the media was removed and replaced with lOOjiil Krebs bicarbonate buffer and the cells were 
10 incubated at 3TC for 15 minutes. Following this, the cells were incubated in Krebs buffer 
containing 0.5mM isobutylmethylxanthine, a phosphodiesterase inhibitor, to prevent cAMP 
breakdown. The effect of H4 on the forskolin induced formation was determined by incubating 
the cells in the presence of forskolin (lOjiM) and agonist for 12 minutes. cAMP levels were 
determined using the cAMP SPA kit according to the manufacturers directions. 

15 

EXAMPLE V: H.^ EXPRESSION IN INBUCED CD4 AND CDS CLONES 

This examination was imdertaken using the TaqMan EZ RT PGR kit and the 

following oligonucleotides (provided by Philip Jones at Wyeth Neuroscience) (i) H4 EX IF 

(forward oligo): 5'-taacttggccatctttgac-3' (SEQ ID NO:13), (ii) H4 EX IR (reverse primer): 5'- 
20 attcgaacagcgtgtgag-3* (SEQ ID NO:14), and H4 EX 1 Probe: 5'-(6-carboxyfluorescein)- 

tacaaaggaatggagatca-3' (SEQ ID NO: 15), RNA for the standard curve was polyA+ human 

leukocyte RNA from Clontech. 

50 nanograms of total RNA was assayed in duplicate from 6 human T cell clones 

derived from a single human donor. Three of these clones were CD4+, two were CD 8+, and one 
25 was an NK T cell clone. These T cell clones were stimulated with anti-CD3 and RNA was 

isolated at 0, 2, 4, 8, 24 and 48 hoiirs after stimulation. Expression was normalized with 

GAPDH. 

In order to determine whether the pattern of H4 expression was specific to clonal 
populations of human lymphocytes, the same TaqMan quantitative PGR assay on 10 nanograms 
30 of RNA from bulk populations of CD4+ and CD8+ T cells isolated from three different 
individuals. These lymphocytes were stimulated with anti-CD3 and harvested at the same time 
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points listed above. H4 expression was normalized with GAPDH. 

RESULTS 

Using a GPCR sequence profile generated from a Hidden Markov Model (HMM) 
5 of the biogenic amine subfamily, several sequences with homology to the human H3 receptor 
were identified from the human genomic sequence of chromosome 18 (clone PJPU- 178F10 
Accession number: AC007922 from the Whitehead Institute/MIT Center for Genome Research) 
(Eddy Bioinformatics, 1998, 14:755-763). A contig of the predicted exons forms a sequence 
encoding a putative GPCR whose nucleotide sequence and translated peptide sequence are shown 

10 in Figure 1 and SEQ ID NOSrl and 2. The sequence was deposited with Genbank (Accession 
Number AF307973), 

The predicted protein sequence contains residues that are conserved across the 
biogenic amine receptors. As in other biogenic amine receptors, the present protein contains 
conserved aspartate residues within transmembrane (TM) domains 2 and 3 (at positions 61 and 

15 94). A DY (D94 and Y95) motif in TM3 is foimd only in histamine and muscarinic receptors. 
A DRY sequence motif that exists at the cytoplasmic interface of TM3 of all GPCRs also can be 
found in the H4 sequence (amino acids 111- 113). Also present are two conserved cysteine 
residues in the first and second extracellular loops (position 88 and 164), which are predicted 
to form a disulfide bond; two conserved tryptophan residues in TM4 and TM6 (positions 140 and 

20 316); and conserved proluae residues in TM5, TM6, and TM7 (positions 186, 318, and 355, 
respectively). Based on these results, H4 cDNA appears to encode a biogenic amine-like 
receptor. 

The predicted open reading frame is 1 173 base pairs long and encodes a protein 
of 390 amino acids. Sequence comparison using BLAST?, under standard conditions, analysis 
25 reveals that the novel protein is most similar to the human H3 receptor (44% identical and 5 1 % 
similar). Based on sequence homology it is proposed that the receptor belongs to the histamine 
receptor family, therefore we have termed it the H4 receptor. 

Receptor distribution studies indicate that the present receptor is highly expressed 
in peripheral blood leukocytes. Trace amounts of the H4 receptor are expressed in heart, lung and 
30 placenta. It is proposed that these trace amounts of expression represent blood cell mRNA 
present in the samples. 
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Functional studies furtlier confirmed the categorization of the protein as a novel 
histamine receptor. Histamine and the H3 selective agonist R-a-methylhistamine stimulated 
cAMP accumulation, suggesting these compoimds are agonists at the H4 receptor (See Figure 
2A). R-a-methylhistamine exhibited both lower potency and efficacy than histamine at the H4 
5 receptor despite being more potent at the related cloned H3 receptor and in several tissue based 
assays for the H3 receptor (Vollinga, et al, J. Med. Chem, 1 995, 38:266-27 1 ; Harper, et al , Br. 
J. Pharmacol. 1999, 128:751-759). The selective H3 receptor antagonist clobenpropit was a 
partial agonist at the H4 receptor (See Figure 2A). 

The H3 antagonist thioperamide almost fully inhibited the stimulatory response 

10 produced by histamine (See Figure 2B). Comparatively, clobenpropit partially blocked 
histamine-induced stimulation of cAMP formation, further suggesting that clobenpropit is a 
partial agonist at the H4 receptor. The interaction of the H4 receptor to a GP AI/ Gai3 chimeric 
G-protein alpha subunit also predicts its coupling specificity in manMnalian cells. 

Expression of the H4 receptor in HEK 293tsa cells confirms flie coupling of the 

15 H4 receptor to the inhibition of cAMP formation (See Figure 2C). In these cells, forskolin 
stimulated 8-fold greater cAMP fonnation compared to basal levels. Addition of histamine (1 
|iM) inhibited forskolin-induced stimulation of cAMP accumulation by about 40%. These 
studies suggest that the H4 receptor couples an inhibitory G-protein to inhibit adenylyl cyclase 
activity and cAMP accumulation. 

20 Structure-activity relationship studies were conducted with several histamine 

antagonists to further define the pharmacological profile of this receptor. Studies were conducted 
to correlate the effect of the alkyl chain to antagonist activity at the histamine H4 receptor (See 
Figure 3). The rank order of efficacy obtained for the human H4 was (number represents the 
length of the alkyl chain) 5>6= 1 0>8=4. This rank order of efficacy contrasts with the htmian 

25 H3 receptor where the rank order is 5>4>6>8>1 0. 

Prior studies have indicated the expression of at least two subtypes of the H3 
receptor. These two potential H3 subtypes, localized in the rat brain and guinea pig jejenum, have 
been shown to have rank order of efficiencies of 4=5>3>6>8 and 5>6=4>8>3, respectively. 
These assays were different (rat was a radioligand binding assay and guinea pig was an organ 

30 bath experiment using isolated guinea pig ileum). However, the relative potencies of the series 
can be compared, so its possible to say that as the rank order differs then the receptors are likely 
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to be different (Leurs et al J. Pharm. Exp. Ther, 1996, 276:1009-1015.) 

The present studies also indicate that thioperamide is more efficacious and potent 
than impentamine at the H4 receptor. Comparatively, impentamine has greater affinity for cloned 
histamine H3 receptors (4-fold) than thioperamide (Ki=50.8nM and 193nM respectively). 

H4 expression was consistently detected in all three CD4+ clones at most time 
points (Figure 4). There was no consistent temporal pattern to anti-CD3 induction of expression 
between these three clones. One CD8+ clone showed a very low but detectable level of 
expression at most time points, while the other as well as the NKT clone had no detectable H4 
expression despite good GAPDH amplification. 

H4 expression, normalized with GAPDH, was detectable in all samples at all time 
points (Figure 5). Anti-CD3 stimulation produced significant induction of H4 expression in all 
6 samples, most peaking at the 8 hour time point The highest level of induction was found in 
the 8 hour CD4+ and CD8-h samples fi-om the same donor (designated *U'), peaking at greater 
than 1 00 copies of H4 RNA per copy of GAPDH. 

Combined, these studies indicate that the pharmacological profile of the H4 
receptor is not similar to any known histamine H3 receptor. 

* * * 

The present invention is not to be limited in scope by the specific embodiments 
described herein. Indeed, various modifications of the invention in addition to those described 
herein will become apparent to those skilled in the art ftom the foregoing description and the 
accompanying figures. Such modifications are intended to fall within the scope of the appended 
claims. 

It is further to be understood that values are approximate, and are provided for 

description. 

Patents, patent applications, publications, procedures, and the like are cited 
throughout this application, the disclosures of which are incorporated herein by reference in their 
entireties. 
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1 1 . An isolated H4 receptor protein having an anndno acid sequence at least 

2 51% identical or comprising at least 10 contiguous amino acids from the sequence depicted in 

3 SEQ ID NO:2 which H4 receptor protein binds ligands comprising an imidazole and an amine, 

4 which imidazole and amine are attached by an alkyi chain, wherein the rank order of efficacy of 

5 modulation second messenger formation of the ligands at the H4 receptor protein is 5>6= 1 0>8=4, 

6 where the number represents the number of carbons in the alkyl chain. 

1 2. The H4 receptor protein of claim 1 , wherein upon binding histamine or a 

2 histamine agonist the receptor protein inhibits second messenger formation. 

1 3. The H4 receptor protein of claim 2, wherein the second messenger is 

2 cAMP. 

1 4. The H4 receptor protein of claim 1 which is a human H4 receptor protein. 

1 5. The H4 receptor protein of claim 4 which has an amino acid sequence £LS 

2 depicted in SEQ ID NO: 2. 

1 6. The H4 receptor protein of claim 4 which is encoded by a nucleic acid 

2 having a sequence as depicted in SEQ ID NO: 1 . 

1 7. An isolated H4 receptor protein having an amino acid sequence vsdth at 

2 least 95% sequence identity to human H4 receptor protein having an amino acid sequence as 

3 depicted in SEQ ID NO: 2. 

1 8, An isolated nucleic acid encoding the H4 receptor protein of cledm 1 or 7, 

1 9. The nucleic acid of claim 8 which is a cDNA. 

o 
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^ '^^^""'^^^ic acid ofclaim 8, wherein the H4 receptor protein is a human 

2 H4 receptor protein. 

^ ^^®^a*e^ nucleic acid encoding an H4 receptor protein, which nucleic 

2 acid hybridizes under stringent conditions to a nucleic acid having a sequence of at least 20 

3 nucleotides identical to a corresponding nucleotide sequence of the same number of bases in SEQ 

4 ID NO: 1 or its complement. 

^ ""^^"^clei^^ acid ofclaim 8, which encodes an H4receptor protein having 

2 an amino acid sequence as depicted in SEQ ID NO:2. 

1 13. The nucleic acid of claun 12, v^ch comprises a nucleotide sequence as 

2 depicted in SEQ ID NO: 1. 

* ^"^^ ^^^<^0''<=oniprising the nucleic acid ofclaim 8 operably associated with 

2 an expression control sequence. 

^ 15. A host cell transfected with the vector of claim 14. 

^ -^^^^-^^an animal transformed with the vector ofclaim 14, wherein the 

2 animal expresses a H4 receptor protein at a detectable level, whereby the cells expressing the H4 

3 receptor protein suppress CAMP formation when contacted with an H4 receptor agonist. 

^ producing a H, receptor protein, which method comprises 

2 culturing host cells of claim 15 under conditions that provide for expression of the receptor 

3 protein by the vector. 

1 18. An isolated nucleic acid ofat least ten bases having a nucleotide sequence 

2 identical to a corresponding nucleotide sequence of the same number of bases in SEQ ID NO: 

3 I or its complement. 
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1 19. The nucleic acid of claim 18 which is detectably labeled. 

1 • 20. An antibody that specifically binds to the H4 receptor protein of claim 1 

2 or 7. 

1 2 1 . A method for detecting an H4 receptor protein, which method comprises 

2 detecting binding of the antibody of claim 20 to a protein in a sample suspected of containing a 

3 H4 receptor protein, wherein the antibody is contacted with the sample under conditions that 

4 permit specific binding with any H4 receptor protein present in the sample. 

1 22. A method for detecting expression of H4 receptor, which method 

2 comprises detecting mRNA encoding H4 receptor in a sample from a cell suspected of expressing 
H4 receptor. 

1 23 . The method according to claim 22 wherein mRNA encoding H4 receptor 

2 is detected by hybridization to a H4 receptor-specific nucleic acid. 

1 24. The method according to claim 23 wherein the H4 receptor-specific nucleic 

2 acid is at least 10 nucleotides in length and has a sequence identical to a sequence of the same 

3 number of bases in SEQ ID NO: 1 , or the complementary sequence thereof. 

1 25, An assay system for identifying H4 receptor ligands, comprising a 

2 sufficient number of cells of claim 1 5 to detect an alteration in second messenger accimiulation. 

1 26. The assay system of claim 25, wherein the second messenger is cAMP. 

1 27. The assay system of claim 25, wherein the receptor is a himian receptor. 

1 28 . A method for identifying a test compound that antagonizes histamine H4 

2 receptors, which method comprises detecting an increase in the level of a second messenger in 

3 an assay system of claim 25 contacted with the test compound, wherein an increase in the level 

4 of the second messenger indicates that the test compound antagonizes the H4 receptor, 
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1 29, A method for identifying a test compound that agonizes histamine H4 

2 receptors, which method comprises detecting a decrease in the level of a second messenger in 

3 an assay system of claim 25 contacted with the test compound, wherein the decrease in the level 

4 of the second messenger indicates that the test compound agonizes the H4 receptor. 

1 30. A method for identifying a compound that binds an H4 receptor, which 

2 method comprises detecting binding of a test compoimd to the H4 receptor protein of claim 1 . 

1 31. The method according to claim 29, wherein binding of the test compound 

2 is detecting by inhibiting binding of a labeled H4 ligand. 

1 32. The method according to claim 29, wherein the H4 receptor protein is in 

2 a lipid bilayer membrane, 

1 33 . An isolated nucleic acid that specifically hybridizes under highly stringent 

2 conditions to the complement of the sequence depicted in SEQ ID NO : 1 , wherein said nucleic 

3 acid encodes a histamine H4 receptor protein. 
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FIG. I A 



( I ) ATGCCAGATACTAATACCACAATCAATTTATCACTAAGCACTCGTGTTACnTACC^ 

MPDTNSTI NLSLSTRVTLAF 



TTT ATOTCCT T ACT AGCTTTTOCT AT AATGCT AGGAAATGCTTTGGTCATTTT aGCTTTT 
FMSLVAFAI MLGNALVI LAP 



( \ 2\ ) GTGGTGGACAAAAACCTTACL\CAT(XiAAGT AGITATA 

VVDKNLRHRSSYFFLNLAl S 



GACTTCrTTGTGGGrOTGATCrCCATT(XTTTOTACATC^ 
DFF VGVI SI PLY I PHTLFEW 



GATTTTGGAAAGGAAATCTGTGrATTTTGOCTCACTACTGACTATCTGTTATGT A 
DFGKEl CVF WLTTDYLLCTA 



TCTGTATATAACATTGTCCrCATCAGCTATGATCGATACCTGTCAGTCTCAAATGCT 
SVYNl VLI SYDRYLSVSNAV 



(361) AGTTATAGAACTCAACATACTCXjOGTCTTGAAGATTGTTACT 

SYRTQHTGVLKI VTLMVAVW 



GTOrrGGCCTTCTTAGTGAATGGCXXZAATGATTCTAGT 
VLAFLVNGPMl LVSESWKDE 



(481) GGTAGrGAATGTGA\(XTGGATTTTTTTCa3^ 

GSECEPGFFSEWYI LAI TSF 



TTGGAATTa7rGATCCXL\GrC\TCTnrAGrO^^ 
LEFVl P-VI LVAYFNMNI YWS 



A _ : A 
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FIG. IB 

A ■■ A 

(60 1) CrGTGGAAGCGrGATCATCrCAGrAGGTGCC\AAGCp^T(XTGG^CT 

L W KRDHLS RCQS HP GLTAVS 

tCX:AACATCTGrGGACACTCATT(>.GAGGTAGACTATCTT<^ 
SNI CGHSFRGRLSSRRSLSA 

(721) TCXiACAGAAGTTCXnrGCATCXTTTCATTCAGAGAGACAG^ 

STEVPASFHSERQRRKSSLM 

TTTTOCTCAAGAAO^AAGATGAATAGCAATACAATTC^^ 
FSSRTKMNSNTI ASK M G S F S 

(841) CAATCAGATTCTGTAGCTCTTCWXAAAGOGAACATGITGA^ 

QS DSVALHQREHVELLRA RR 



TTAGOCAAiCn-CACTGaOCATTCrCTTAGaGGTTT^ 
LAKSLAI LLGVFA VCWAP YS 



(961) CTGTTCACAATTGTCCTTTCATTTTATTarrCAGCAAC^^ 

LFTI VLSFYSSATGP KSVWY 



AGAATTGCATTTTGGCITCACTGarrCAATTCCT^ 
RI AFWLQWFNSFVNPLLYPL 



( J 0 8 1 ) TGTCACAAGCGCmCAAAAaxnTTCrrGAAAATATT^ 

CHKRFQKAFLKI FCI KKQP L 

( U •« t ) OCATCACAACACAGTOGCTrCAGrAT Ct ICl 1 AA 
PSQHSRSVSS 
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FIG. 2A 
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FIG. 2C 



0.0 



basal 



forsk 



his 



BNSDOCID: <WO_0l8S7e6A3JA> 



SUBSTITUTE SHEET (RULE 26) 



wo 01/085786 



PCT/USOl/14527 



5/7 




018S786A3JA: 



SUBSTITUTE SHEET (RULE 26) 



wo 01/085786 



PCT/USOl/14527 



6/7 



CD < 
CO CO 



O < 



3-^ 00 00 T- 

O O CD CD CD 

q: q: ir £ £ a: 

^ ^ m D M M 



S2 

Ll 



Jdl 



r 

o 



"T 



(D 



-r 



t: 



co 



I 

CM 



00 



o 

CO E 



CO 
CO 
O 

o 



C3 

o 

X 



CM 



(Bu) VN^ +vA|Od ^na-inH sjueiBAmbg 



BNSOOCID: <Vra_01857e6A3JA> 



SUBSTITUTE SHEET (RULE 26) 



wo 01/085786 PCT/USOl/14527 



7/7 



00 


CO CO 








Q 


Q O 


Q 


Q 


Q 


O 


o o 


O 


O 


O 


1 




I 


1 






0 0 


m 








I 



in 



tL. 



— E 





7- 

O 
CO 



-r 
o 



o 

CM 



1^ 
o 
o 



o 

00 



"T" 

o 



O 



00 



Si 



<M 



00 



O O 
CM 



. c 
o 

m 

D 

E 

CO 

Q 
O 

c 

w 

u 

. 3 
O 
X 



BNSOOCID: <WO QIBSTBSAa IA> 



SUBSTITUTE SHEET (RULE 26) 



wo 01/085786 



PCT/USOl/14527 
■> 



SEQUENCE LISTING 



<110> Jones, Philip 
Shujian, Wu 
Blatcher, Maria 
Pausch, Mark 



<120> 


Hioman Histamine H4 Receptor 








<130> 


3655/1J100-US3 












<140> 
<141> 


TBA 

Concurrently Herewith 










<150> 
<151> 


60/202, 151 
2000-05-05 












<150> 
<151> 


60/227, 567 
2000-08-23 












<150> 
<151> 


60/247, 855 
2000-11-13 












<160> 


15 














<170> 


Patentin version 3.0 










<210> 
<211> 
<212> 
<213> 


1 

1173 

DNA 

Homo sapiens 












<400> 1 
atgccagata 


ctaatagcac 


aatcaattta 


tcactaagca 


ctcgtgttac 


tttagcattt 


60 


tttatgtcct 


tagtagcttt 


tgctataatg 


ctaggaaatg 


ctttggtcat 


tttagctttt 


120 


gtggtggaca 


aaaaccttag 


acatcgaagt 


aguTia u u tlxi u 


ttcttaactt 


ggccatctct 


180 


gacttctttg 


tgggtgtgat 


ctccattcct 


ttgtacatcc 


ctcacacgct 


gttcgaatgg 


240 


gattttggaa 


aggaaatctg 


tgtattttgg 


ctcactactg 


actatctgtt 


atgtacagca 


300 


tctgtatata 


acattgtcct 


catcagctat 


gatcgatacc 


tgtcagtctc 


aaatgctgta 


360 


agttatagaa 


ctcaacatac 


tggggtcttg 


aagattgtta 


ctctgatggt 


ggccgtttgg 


420 


gtgctggcct 


tcttagtgaa 


tgggccaatg 


attctagttt 


cagagtcttg 


gaaggatgaa 


480 


ggtagtgaat 


gtgaacctgg 


atttttttcg 


gaatggtaca 


tccttgccat 


cacatcattc 


540 


ttggaattcg 


tgatcccagt 


catcttagtc 


gcttatttca 


acatgaatat 


ttattggagc 


600 


ctgtggaagc 


gtgatcatct 


cagtaggtgc 


caaagccatc 


ctggactgac 


tgctgtctct 


660 
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tccaacatct 


gtggacactc 


attcagaggt 


agactatctt 


caaggagatc 


tctttctgca 


720 


tcgacagaag 


ttcctgcatc 


ctttcattca 


gagagacaga 


ggagaaagag 


tagtctcatg 


780 


ttttcctcaa 


gaaccaagat 


gaatagcaat 


acaattgctt 


ccaaaatggg 


ttccttctcc 


840 


caatcagatt 


ctgtagctct 


tcaccaaagg 


gaacatgttg 


aactgctcag 


agccaggaga 


900 


ttagccaagt 


cactggccat 


tctcttaggg 


gtttttgctg 


tttgctgggc 


tccatattct 


960 


ctgttcacaa 


ttgtcctttc 


attttattcc 


tcagcaacag 


gtcctaaatc 


agtttggtat 


1020 


agaattgcat 


tttggcttca 


gtggttcaat 


tcctttgtca 


atcctctttt 


gtatccattg 


1080 


tgtcacaagc 


gctttcaaaa 


ggctttcttg 


aaaatatttt 


gtataaaaaa 


gcaacctcta 


1140 


ccatcacaac 


acagtcggtc 


agtatcttct 


taa 






1173 



<210> 2 

<211> 390 

<212> PRT 

<213> Homo sapiens 

<400> 2 

Met Pro Asp Thr Asn 
1 5 

Thr Leu Ala Phe Phe 
20 

Asn Ala Leu Val He 
35 

Arg Ser Ser Tyr Phe 
50 

Gly Val He Ser He 
65 



Asp Phe Gly Lys Glu 

85 

Leu Cys Thr Ala Ser 
100 



Tyr Leu Ser Val Ser 
115 

Val Leu Lys He Val 
130 

Leu Val Asn Gly Pro 



Ser Thr He Asn Leu 

10 

Met Ser Leu Val Ala 
25 

Leu Ala Phe Val Val 
40 

Phe Leu Asn Leu Ala 
55 



Pro Leu Tyr He Pro 
70 

He Cys Val Phe Trp 

90 

Val Tyr Asn He Val 
105 

Asn Ala Val Ser Tyr 
120 

Thr Leu Met Val Ala 
135 

Met He Leu Val Ser 



Ser Leu Ser Thr Arg Val 

15 

Phe Ala He Met Leu Gly 
30 

Asp Lys Asn Leu Arg His 
45 



He Ser Asp Phe Phe Val 
60 

His Thr Leu Phe Glu Trp 
75 80 

Leu Thr Thr Asp Tyr Leu 

95 

Leu He Ser Tyr Asp Arg 
110 

Arg Thr Gin His Thr Gly 
125 

Val Trp Val Leu Ala Phe 
140 

Glu Ser Trp Lys Asp Glu 
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145 150 155 160 

Gly Ser Glu Cys Glu Pro Gly Phe Phe Ser Glu Trp Tyr lie Leu Ala 

165 170 175 

lie Thr Ser Phe Leu Glu Phe Val lie Pro Val lie Leu Val Ala Tyr 
180 185 190 

Phe Asn Met Asn lie Tyr Trp Ser Leu Trp Lys Arg Asp His Leu Ser 
195 200 205 

Arg Cys Gin Ser His Pro Gly Leu Thr Ala Val Ser Ser Asn lie Cys 
210 215 220 

Gly His Ser Phe Arg Gly Arg Leu Ser Ser Arg Arg Ser Leu Ser Ala 
225 230 235 240 

Ser Thr Glu Val Pro Ala Ser Phe His Ser Glu Arg Gin Arg Arg Lys 

245 250 255 

Ser Ser Leu Met Phe Ser Ser Arg Thr Lys Met Asn Ser Asn Thr lie 
260 265 270 

Ala Ser Lys Met Gly Ser Phe Ser Gin Ser Asp Ser Val Ala Leu His 
275 280 285 

Gin Arg Glu His Val Glu Leu Leu Arg Ala Arg Arg Leu Ala Lys Ser 
290 295 300 

Leu Ala lie Leu Leu Gly Val Phe Ala Val Cys Trp Ala Pro Tyr Ser 
305 .310 315 320 

Leu Phe Thr lie Val Leu Ser Phe Tyr Ser Ser Ala Thr Gly Pro Lys 

325 330 335 

Ser Val Trp Tyr Arg lie Ala Phe Trp Leu Gin Trp Phe Asn Ser Phe 
340 345 350 

Val Asn Pro Leu Leu Tyr Pro Leu Cys His Lys Arg Phe Gin Lys Ala 
355 360 365 

Phe Leu Lys lie Phe Cys lie Lys Lys Gin Pro Leu Pro Ser Gin His 
370 375 380 

Ser Arg Ser Val Ser Ser 
385 390 

<210> 3 

<2li> 259 

<212> PRT 

<213> Artificial 

<220> 

<223> Consensus secfuence 
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<400> 3 

Gly Asn Leu Leu Val lie Leu Val lie Leu Arg Thr Lys Lys Leu Arg 
1 5 10 15 

Thr Pro Thr Asn lie Phe lie Leu Asn Leu Ala Val Ala Asp Leu Leu 
20 25 30 

Phe Leu Leu Thr Leu Pro Pro Trp Ala Leu Tyr Tyr Leu Val Gly Gly 
35 40 45 

Ser Glu Asp Trp Pro Phe Gly Ser Ala Leu Cys Lys Leu Val Thr Ala 
50 55 60 

Leu Asp Val Val Asn Met Tyr Ala Ser lie Leu Leu Leu Thr Ala lie 
65 70 75 80 

Ser lie Asp Arg Tyr Leu Ala lie Val His Pro Leu Arg Tyr Arg Arg 

85 90 95 

Arg Arg Thr Ser Pro Arg Arg Ala Lys Val Val lie Leu Leu Val Trp 
100 105 110 

Val Leu Ala Leu Leu Leu Ser Leu Pro Pro Leu Leu Phe Ser Trp Val 
115 120 125 

Lys Thr Val Glu Glu Gly Asn Gly Thr Leu Asn Val Asn Val Thr Val 
130 135 140 

Cys Leu lie Asp Phe Pro Glu Glu Ser Thr Ala Ser Val Ser Thr Trp 
145 150 155 160 

Leu Arg Ser Tyr Val Leu Leu Ser Thr Leu Val Gly Phe Leu Leu Pro 

165 170 175 

Leu Leu Val lie Leu Val Cys Tyr Thr Arg lie Leu Arg Thr Leu Arg 
180 185 190 

Lys Ala Ala Lys Thr Leu Leu Val Val Val Val Val Phe Val Leu Cys 
195 200 205 

Trp Leu Pro Tyr Phe lie Val Leu Leu Leu Asp Thr Leu Cys Leu Ser 
210 215 220 

lie lie Met Ser Ser Thr Cys Glu Leu Glu Arg Val Leu Pro Thr Ala 
225 230 235 240 

Leu Leu Val Thr Leu Trp Leu Ala Tyr Val Asn Ser Cys Leu Asn Pro 

245 250 255 

He He Tyr 
<210> 4 
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<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Forward primer 
<400> 4 

taacttggcc attgacttct t 



<210> 5 

<211> 22 

<212> DNA 

<213> Artificial 

<220> 

<223> Reverse Primer 

<400> 5 

attcgaacag catgtgaggg at 



<210> 6 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Probe 
<220> 

<221> modif ied_base 

<222> (1) . . (1) 

<223> (6-carboxyfluorescein) -t 



<220> 

<221> modif ied_base 

<222> (27) . . (27) 

<223> a- {6-carboxy-N,N,N'N'-tetramethylrhodamine) 

<400> 6 

tacaaaggaa tggagatcac acccaca 

<210> 7 

<211> 29 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide 
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<400> 7 

aaggatccaa aatgccagat actaa'tagc 

<210> 8 

<211> 28 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide 
<400> 8 

aagtcgactt aagaagatac tgaccgac 

<210> 9 

<211> 43 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 
<400> 9 

aagcttccac catgccagat actaatagca caatcaattt ate 



<210> 


10 


<211> 


33 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Primer 


<400> 


10 



tctagattaa gaagatactg accgactgtg ttg 



<210> 11 

<211> 22 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 11 

ggaaggatga aggtagtgaa tg 

<210> 12 

<211> 22 
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<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Primer 


<400> 


12 


cagaatctga ttgggaggaa gg 


<210> 


13 


<211> 


19 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Oligonucleotide 


<400> 


13 


taacttggcc atctttgac 


<210> 


14 


<211> 


18 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Oligonucleotide 


<400> 


14 


attcgaacag cgtgtgag 


<210> 


15 


<211> 


19 


<212> 


DNA 


<213> 


Artificial 


<220> 




<223> 


Oligonucleotide 


<220> 




<221> 


modified base 


<222> 


(1) . . (1) 


<223> 


5 ' - ( 6-carboxyf luori 


<400> 


15 



tacaaaggaa tggagatca 



PCTAJSOl/14527 



22 



19 



18 
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